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Abstract 



In this paper we study the impact of the introduction of edge weights on the typical 
distances in a random graph and, in particular, on its diameter. Our main result consists of 
a precise asymptotic expression for the maximal weight of the shortest weight paths between 
a random vertex and all other vertices (the flooding time), as well as the (weighted) diameter 
of sparse random graphs, when the edge weights are i.i.d. exponential random variables. 
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1 Introduction and main results 

A weighted graph (G, w) is the data of a graph G = (V, E) and a collection of weights w = 
{we}e(^E associated to each edge e & E. We suppose that all the edge weights are non-negative. 
For two vertices a and b G V, a path between a and 6 is a sequence vr = (ei, 62, • • • e^) where 
e, = {vi-i,Vi} G E and Vi € V for i € {1, . . . , k} =: [1, k], with vq = a and Vk = b. We write 
e G vr if the edge e £ E belongs to the path vr, i.e., if e = Cj for an i G [1, k]. For a,b £ V, the 
weighted distance between a and b is given by 



where the minimum is taken over all the paths between a and b in the graph. The weighted 
diameter is then given by 




max{distt„(a, 5), a,b £V, dist^(a, 5) < 00} 



while the weighted flooding time is 



floodt„(G) = max{distt„(a, 6), b £ V, dist^(a, 6) < 00} 



where a is chosen uniformly at random in V. 
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Configuration model. For n G N, let (dj)" be a sequence of non-negative integers such 
that X^iLi di is even. By means of the configuration model [7], we define a random multigraph 
with given degree sequence {di)i, denoted by G*{n, {di)^) as follows: to each node i S [l,n] 
we associate di labeled half-edges. All half-edges need to be paired to construct the graph, 
this is done by randomly matching them. When a half-edge of i is paired with a half-edge of 
j, we interpret this as an edge between i and j. The graph G* {n, {di)1) obtained following 
this procedure may not be simple, i.e., may contain self- loops due to the pairing of two half- 
edges of i, and multi-edges due to the existence of more than one pairing between two given 
nodes. Conditional on the multigraph G*{n, (di)") being a simple graph, we obtain a uniformly 
distributed random graph with the given degree sequence, which we denote by (dj)"), 
|23| . We consider asymptotics as the numbers of vertices tend to infinity, and thus we assume 
throughout the paper that we are given, for each n, a sequence d*-"-* = (dj-"^)" = (di)i of non- 
negative integers such that X^^^ dj"'* is even. For notational simplicity we will sometimes not 
show the dependency on n explicitly. 

For /c G N, let n^""* = di = k}\ be the number of vertices of degree k. From now on, 
we assume that the sequence (di)" satisfies the following regularity conditions analogous to the 
ones introduced in [28] : 

Condition 1.1. For each n, d^") = (d^"^)" = (di)i is a sequence of positive integers such that 
Z^r=i even and, for some probability distribution {pr)^o over integers independent of n 
and with finite mean fi := Yl,k>o^Pk £ (0, oo), the following holds: 

(i) u^!^ jn — )• pfc for every > 1 as ?i — )• oo; 

(ii) For some e > 0, ^^=1 '^^^ = 0{n). 

Note that the condition di > 1 for all i, is not restrictive since removing all isolated vertices 
from a graph will not affect its (weighted) diameter or flooding time. 

Remark 1.2. The results of this work can be applied to some other random graphs models by 
conditioning on the degree sequence. In particular, our results will apply whenever the random 
graph conditioned on the degree sequence has a uniform distribution over all possibilities. 
Notable examples of such graphs are G{n,p), the Bernoulli random graph with n vertices and 
edge probability p and G{n, m), the uniformly random graph with n vertices and m edges. For 
example, for G{n,p) with np ^ fi £ (0,oo) or G{n,m) with 2m/n — )• fi, the Condition ll.l( i) 
holds in probability with (p^) a Poisson distribution with parameter ^, p^ = e~^^. In section 
[5l we show that thanks to Skorohod coupling theorem ([251 Theorem 3.30]) our results still 
apply in this setting. 

Main results. We define q = {qk}^o the size-biased probability mass function corresponding 
to p, by 

(/j _|_ i^p °° 
\/k > 0, Qk '■= ,and let denote its mean: u = kq^ G (0, oo). (1) 
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The condition v > 1 is equivalent to the existence of a giant component in the configuration 
model, the size of which is proportional to n (see e.g., \23\ I28j). We will assume that > 1 in 
the rest of the paper. 

Let (l)p{z) be the probability generating function of {pk}'kLo- 4^p{z) = '}^'k=oPk^^ ^ let 
(j)q{z) be the probability generating function of {gfej^g- "^gC-^) ~ Sfclo ^^-^^^ ~ 

We denote by A the smallest solution in [0, 1] of the fixed point equation: A = 0g(A). In 
addition, we introduce 

oo 

A* = <(A) = J^fcgfcA^-^ (2) 

fc=i 

The interpretation of the parameters A and A* in term of a branching process is given in 
Appendix lA.li 

We can now state our main theorem. 

Theorem 1.3. Consider a sequence of random weighted graphs {G{n, (di)i),w) where w = 
{we}eeE o,re i.i.d. rate one exponential random variables. Assume Condition 11.11 and that v 
defined in ([7P is such that u > 1. Assume that all the graphs have the same minimum degree 
denoted by d^i^ = min,jg[x „] di and moreover that Pd^^i^ > 0. Let F : N — )• N be defined by: 

T{d) := dl[d > 3] + 2(1 - qi) l[d = 2] + (1 - A*) l[d = 1]. (3) 

We have 



diam^(G(n, (d,)^)) ^ _J_ ^ 2 floods (G(n, (d,)?)) ^ _J_ ^ 1 

logn l^-l r((imin)' ' logn U-l T{dram)' 

The decomposition of each limit for the weighted diameter and the flooding time in two 
terms with a common factor has a simple interpretation. We now present it with an 
overview of the proof. The first key idea of the proof is to grow balls centered at all vertices of 
the graph simultaneously. The time when two balls centered at a and b respectively intersect is 
exactly the half of the weighted distance between a and b. In what follows, we will sometimes 
deliberately use the term time instead of the term weighted distance. Hence the weighted 
diameter becomes twice the time when the last two balls intersect and the flooding time becomes 
twice the time when the last ball intersects the ball centered at an uniformly chosen vertex. 
A simple argument shows that any two balls containing (slightly more than ) -y/n vertices will 
intersect (with high probability) as shown by Proposition l3.11 Hence it will be enough to control 
the time at which all balls have reached this critical size of order ^/n in order to prove an upper 
bound for the weighted diameter and the flooding time. In order to do so, we apply an union 
bound argument so that we need to find the right time such that the probability that a ball 
(taken uniformly at random) reaches at that time a size at most ^/n is of order n~^. In order 
to do so, we use the second main idea of the proof: we couple the exploration process on the 
weighted graph with a continuous time Markov branching process. It is simple to guess which 
branching process approximates the exploration process on the graph. The main difficulty here 
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is that since we are interested in events of small probability for this exploration process (of 
order we need to show that the coupling introduces errors on the probabilities which 

are negligible in comparison to n~^. Considering graphs with dmin = 1 introduces additional 
technical difficulties since in this case, the 'giant component' of the graph contains only a 
fraction of the nodes. 

It is instructive to take this coupling as granted and to consider now the large deviation 
problem for the continuous time Markov process. We consider a general one dimensional 
continuous time Markov branching process {Z(t)]t > 0} with infinitesimal generating function 
f{s)—s, where f{s) = g^s* with > and % = 1 (see e.g., [2]). We recall the construction 
of the process {Z{t)} and introduce some required notations. Let > 1} be a sequence of 
i.i.d. random variables with generating function / and let r]i = ^i — 1. We define Sq = Z{0) = k 
for k > 1, and S'^ = k + ^"^^ i]i for n > 1. Let I := inf{n; = 0} > 1. If 5^ / for ah 
n, then / = oo. Given the sequence {^j}, let , ■ ■ ■ ,Tj be mutually independent exponential 
random variables with means 



E 



I {^^} 
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We define the sequence of split times by Tq = and = rf^ + • • • + for 1 < n < /, and let 

_ / for T^-i <t<Tll<n<I, 

^^*^-\0 foi Tf<t. 

On the event / < oo, we take the convention = +oo for any n > / + 1. This event 
corresponds to the extinction of the branching process. Let A denote its probability which is 
the smallest root in [0, 1] to s = f{s). We also define A* = /'(A). 

Theorem 1.4 (PQ). In previous setting with 1 < /'(I) < oo, we denote (,min = > 0}. 

For any x > and k > 1, we have 



with 



lim -^logP ( oo > Tli >(x + —\ — - ) logn ) = -xg{Um,k), 



giUin, k) = kl (6nm > 2) + k{l - qi)l {Uin = 1) + (1 " A,)l (Uin = 0) . 



(4) 



The proof of this theorem can be found in pjj. Coming back to the branching process 
approximating the exploration process, we should consider the case where k is the degree of 
the center of the ball hence a random variable with distribution p = {pk} and the distribution 
q = {qk} is given by ([T]). Now k being random, the main contribution in the large deviation Q 
will be given by the term with minimal possible value of k, i.e., dmin- However ^min being the 
minimal offspring, we should have ^min = dmin — 1- The fact that T{dram) = gidmin — 1> c?min) can 
now be understood easily: it follows from that at time 2(^.^1) + r(d^ ) exploration process 
reaches size of order ^/n with probability of order n^^. By previous (heuristic) argument, we 
know that this is half the value of the diameter. 
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Outline. In the next section, we consider the exploration process for configuration model 
which consists in growing balls simultaneously from each vertex. The diameter will be the 
time the last pair of balls intersect. A precise treatment of the exploration process, resulting 
in information about the growth rates of the balls are given in this section. In addition, the 
section provides some necessary notations and definitions that will be used throughout the last 
three sections. Sections [3] and H] form the heart of the proof. We first prove that the above 
bound is an upper bound for the weighted diameter. This will consist in defining the two 
parameters and Pn with the following significance, {i) Two balls of size at least /3„ intersect 
almost surely, [ii) considering the growing balls centered at a vertex in the graph, the time it 
takes for the balls to go from size an to size /3„ have all the same asymptotic for all the vertices 
of the graph, and the asymptotic is half of the typical weighted distance in the graph, and [iii) 
the time it takes for the growing balls centered at a given vertex to reach size at least is 
upper bounded by (1 + e)r((imin) log n for all e > with high probability (w.h.p.). This will 
show that the diameter is w.h.p. bounded above by (1 + + 2T{d^\^)) logn, for all e > 0. 

The last section provides the corresponding lower bound. To obtain the lower bound, we show 
that w.h.p. {iv) there are at least two nodes with degree drain such that the time it takes for 
the balls centered at these vertices to achieve size at least a„ is worst than the other vertices, 
and is lower bounded by (1 — e)r(dinin) log n, for all e > 0. And using this, we conclude that the 
diameter is w.h.p. bounded below by (1 — e){-^^ + 2T{dimn)) log n, for all fixed e > 0, finishing 
the proof of our main theorem. The actual values of and /3„ will be 



In Section [5] we show that our results still apply for random graphs G{n,p) and G{n, m) by 
conditioning on the degree sequence. When dmin = the longest shortest path in a random 
graph will be between a pair of vertices a and b of degree one. Furthermore, this path consists 
of a path from a to the 2-core, a path through the 2-core, and a path from the 2-core to h. For 
this, we need to provide some preliminary results on the structure of the 2-core, this is done in 
Appendix lA. 21 

Related literature. The typical distance and diameter of non-weighted graphs have been 
studied by many people, for various models of random graphs. A few examples are the results of 
BoUobas and Fernandez de la Vega [8], van der Hofstad, Hooghiemstra and Van Mieghem jl8j . 
Fernholz and Ramachandran |14] . Chung and Lu [10], Bollobas, Janson and Riordan [9] and 
Riordan and Wormald |29] . 

The analysis of the asymptotics of typical distances in edge weighted graphs has received 
much interest by the statistical physics community in the context of first passage percolation 
problems. First-passage percolation (FPP) describes the dynamics of a fluid spreading within a 
random medium. This model has been mainly studied on lattices motivated by its subadditive 
property and its link to a number of other stochastic processes, see e.g., [151 [261 [16] for a more 
detailed discussion. 

First passage percolation with exponential weights has received substantial attention (see 
[31 [m [171 [13 HI [SI [201 [5])) in particular on the complete graph, and, more recently, also on 
random graphs. In [20], Janson considered the special case of the complete graph with fairly 




(5) 



5 



general i.i.d. weights on edges, including the exponential distribution with parameter one. It 
is shown that, when n goes to infinity, the asymptotic distance for two given points is log n/n, 
that the maximum distance if one point is fixed and the other varies (the flooding time) is 

2 log n/n, and the maximum distance over all pairs of points (i.e., the weighted diameter) is 

3 log n/n. He also derived asymptotic results for the corresponding number of hops or hopcount 
(the number of edges on the paths with the smallest weight). More recently, a number of 
papers provide a detailed analysis of the scaling behavior of the joint distribution of the first 
passage percolation and the corresponding hopcount for the complete graph, e.g., [171 E]- Iii 
particular, Bhamadi derives in [3], limiting distributions for the first passage percolation on 
both the complete graph and dense Erdos-Renyi random graphs with exponential and uniform 
i.i.d. weights on edges. This extends previous results by van der Hofstad et al. [T7] exploring 
the link between the flooding time and first-passage percolation for both of these graphs with 
exponential edge-weights. 

More closely related to the present work, Bhamidi, van der Hofstad, and Hooghiemstra [6] 
study first passage percolation on random graphs with finite average degree, minimum degree 
greater than two and exponential weights, and derive explicit distributional asymptotic for the 
total weight of the shortest-weight path between two uniformly chosen vertices in the network. 
Our proofs will show that the analysis made in [6] is not sufficient to obtain results for the 
diameter. Indeed, we need to use large deviation techniques to control all the vertices and 
not only the uniformly chosen ones. A lower bound of the weighted diameter in the case 
of Erdos-Renyi random graphs was given by Bhamidi, van der Hofstad and Hooghiemstra in 
[4]. In particular, our theorem improves this bound, and gives the correct asymptotic (see 
Remark II. 2p . The base of our method is an extension of that (exploration process) used by 
Ding, Kim, Lubetzky, and Peres [12] to study the weighted diameter in random regular graphs; 
the setup and proofs are significantly more involved in the present setting. 

Basic notations. For non-negative sequences Xn and y„, we describe their relative order of 
magnitude using Landau's o(.) and 0{.) notation: We write Xn = 0{yn) if there exist G N 
and C > such that Xn < Cyn for all n > N. Occasionally, we write Xn = ^{vn) to mean 
that there exists iV > and C > such that for all n > A^, x„ > Cy^. If x„ = 0{yn) and 
Xn = ^l{yn), then we write Xn = 0(yn)- We write x„ ~ y„ when x„/y.„ — )• 1 as ?i — )• oo. 
We usually do not make explicit reference to the probability space since it is usually clear to 
which one we are referring. We say that an event A holds almost surely, and we write a.s., if 
P(^) = 1. The indicator function of an event A is of particular interest, and it is denoted by 
l[A]. We consider the asymptotic case when n — )■ oo and say that an event holds w.h.p. (with 
high probability) if it holds with probability tending to 1 as n — )• oo. We denote by — and 
— >, convergence in distribution, and in probability, respectively. Similarly, we use Op and Op 
in a standard way. For example, if (Xn) is a sequence of random variables, then Xn = Op{l) 
means that "X„ is bounded in probability" and Xn = Op{n) means that Xn/nA-Q. 
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2 First Passage Percolation in G*{n, {di)^) 

We start this section by introducing some new notations and definitions. Before this, one 
remark is in order. In what fohows, we wih sometimes deliberately use the term "time" instead 
of the term "weighted distance". It will be clear from the context what we actually mean by 
this. 

Let (G = {V,E),w) be a weighted graph. For a vertex a ^ V and a real number t > 0, 
the t-radius neighborhood of a in the (weighted) graph, or the ball of radius t centered at a, is 
defined as 

(a, t) := { 6, dist^ (a, 6) < t } . 

The first time t where the ball Bw{a,t) reaches size fc + 1 will be denoted by Ta{k) for k>Q, 
i.e. 

Ta{k) = min 

If there is no such t, i.e. if the component containing a has size at most we define Ta{k) = oo. 
More precisely, we use la to denote the size of the component containing a in the graph minus 
one, in other words, 

la •■= max{ \Bw{a,t)\,t > } - 1, 

so that for all k > la, we set Ta{k) = oo. Note that there is a vertex in By^{a,Ta{k)) which is 
not in any ball of smaller radius around a. When the weights are i.i.d. according to a random 
variable with continuous density, this vertex is in addition unique with probability one. We will 
assume this in what follows. For an integer i < /ai we use da{i) to denote the forward-degree 
of the (unique) node added at time Ta{i) in B^^a, Ta{i)). Recall that the forward-degree is the 
degree minus one. Define Sa{i) as follows. 

Sa{i).= da + da{l) + ...+da{i)-i, Sa{0) = da. (6) 

For a connected graph H, the tree excess of H is denoted by tx{H), which is the maximum 
number of edges that can be deleted from H while still keeping it connected. By an abuse of 
notation, for a subset W C V, we denote by tx(VF) the tree excess of the induced subgraph 
G[W] of G on W. (If G[W] is not connected, then tx(IF) := oo.) Consider the growing balls 
Byj{a,Ta{i)) for < i < /a centered at a and let Xa{i) be the tree excess of By^{a,Ta{i)): 

Xa{i) ■.= tx{B^{a,Ta{i))). 

We extend the definition of Xa to all the integer values by setting Xa{i) = Xa{Ia) for all i > la- 
The number of edges crossing the boundary of the ball By,{a,Ta{i)) is denoted by Sa{i)- A 
simple calculation shows that 

Sa{i) = Sa{€) - 2Xa{i). (7) 

We now consider a random graph G{n, {di)^) with i.i.d. rate one exponential weights on 
its edges, such that the degree sequence (dj)" satisfies Condition ll.il We let m^") be the total 
degree defined by 

n 

i=l A:>0 
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One particularly useful property of the configuration model is that it allows one to construct 
the graph gradually, exposing the edges of the perfect matching one at a time. This way, each 
additional edge is uniformly distributed among all possible edges on the remaining (unmatched) 
half-edges. We have the following useful lemma. 

Lemma 2.1. For any k < ™ 2~" ' '"'^ have 

P (2Xa{k) > X I Sa{k)Ja > A:) < P (^Bin {Sa{k), \ISa{k)/i^ > X I Sa{k)^ . 
A proof of this lemma is given in Section IX3l 

In the sequel, we will also need to consider the number of vertices of forward-degree at least 
two in the (growing) balls centered at a vertex a . Thus, for i < la, define 

i 

7a(i) := J^lK(^)>2] = |{6GS^(o,T,(i)) : 6/aand4>3}|, (8) 
e=i 

and extend the definition to all integers by setting 7a (i) = ^a{Ia) for all i > la- Note that 
7a (0) = and 7q_(z) = i if dmin > 3. 

Now define Ta{k) to be the first time where the ball centered at a has at least k nodes of 
forward-degree at least two. More precisely, 

Ta{i) ■■= min|ra(^), for ^ such that 7a (^) > (9) 

The main idea of the proof of Theorem 11.31 consists in growing the balls around each vertex 
of the graph simultaneously so that the diameter becomes equal to twice the time when the last 
two balls intersect. In what follows, instead of taking a graph at random and then analyzing 
the balls, we use a standard coupling argument in random graph theory which allows to build 
the balls and the graph at the same time. We present this coupling in the next coming section. 

2.1 The exploration process 

Fix a vertex a in G*{n, (dj)"), and consider the following continuous-time exploration process. 
At time t = 0, we have a neighborhood consisting only of a, and for t > 0, the neighborhood 
is precisely Bw{a,t). We now give an equivalent description of this process. This provides a 
more convenient way for analyzing the random variables which are crucial in our argument, 
e.g., Sa{k). The idea is that instead of taking a graph at random and then analyzing the 
balls, the graph and the balls are built at the same time. We will consider a growing set of 
vertices denoted by B and a list L of yet unmatched half-edges in B. Recall that in the usual 
way of constructing a random graph with given degree sequence, we match half-edges amongst 
themselves uniformly at random. In the following, by a matching, we mean a pair of matched 
half-edges. 

• Start with B = {a}, where a has da half-edges. For each half edge, decide (at random 
depending on the previous choices) if the half-edge is matched to a half-edge adjacent 
to a or not. Reveal the matchings consisting of those half-edges adjacent to a which are 
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connected amongst themselves (creating self-loops at a) and assign weights independently 
at random to these edges. The remaining unmatched half-edges adjacent to a are stored 
in a list L. (See the next step including a more precise description of this first step.) 

• Repeat the following exploration step as long as the list L is not empty. 

• Given there are i > 1 half-edges in the current list, say L = {hi, . . . , hi), let ^ ~ Exp(£) be 
an exponential variable with mean l"^. After time ^' select a half-edge from L uniformly 
at random, say h^. Remove hi from L and match it to a uniformly chosen half-edge in the 
entire graph excluding L, say h. Add the new vertex (connected to h) to B and reveal 
the matchings (and weights) of any of its half-edges whose matched half-edge is also in 
B. More precisely, let d be the degree of this new vertex and 2x the number of already 
matched half-edges in B (including the matched half-edges hi and h). There is a total 
of m — 2x unmatched half-edges, m being the total number of half-edges of the random 
graph G. Consider one of the d — 1 half-edges of the new vertex (excluding h which is 
connected to hi); with probability {^ — l)/{nn — 22; — 1) it is matched with a half-edge in 
L and with the complementary probability it is matched with an unmatched half-edge 
outside L. In the first case, match it to a uniformly chosen half-edge of L and remove 
the corresponding half-edge from L. In the second case, add it to L. We proceed in the 
similar manner for all the d — 1 half-edges of the new vertex. 

Let B[a, t) and L[a, t) be respectively the set of vertices and the list generated by the above 
procedure at time t, where a is the initial vertex. Considering the usual configuration model 
and using the memoryless property of the exponential distribution, we have Bw{a,t) = B{a,t) 
for all t. To see this, we can continuously grow the weights of the half-edges hi, ... ,hi in L until 
one of their rate 1 exponential clocks fire. Since the minimum of i i.i.d exponential variables 
with rate 1 is exponential with rate i, this is the same as choosing uniformly a half-edge hi 
after time ^ (recall that by our conditioning, these i half-edges do not pair within themselves). 
Note that the final weight of an edge is accumulated between the time of arrival of its first 
half-edge and the time of its pairing (except edges going back into B whose weights are revealed 
immediately). Then the equivalence follows from the memoryless property of the exponential 
distribution. 

Note that Ta{i) is the time of the i-th exploration step in the above continuous-time ex- 
ploration process. Assuming L{a,Ta{i)) is not empty, at time Ta{i + 1), we match a uniformly 
chosen half-edge from the set L{a,Ta{i)) to a uniformly chosen half-edge among all other half- 
edges, excluding those in L{a,Ta{i)). Let Ft be the cj-field generated by the above process 
until time t. Given Ta{i -|- 1) — Ta{i) is an exponential random variable with rate Sa{i) 

given by Equation ^ wich is equal to \L{a,Ta{i))\ the size of the list consisting of unmatched 
half-edges in B{a,Ta{i)). In other words, 

(r,(i + 1) - Ta{t) I J-T4i)) = Exp(5,(i)), 

this is true since the minimum of k i.i.d. rate one exponential random variables is an exponential 
of rate k. 

Recall that la = min{i, Sa{i) = 0} < n — 1, and set Sa{i) = for all < i < n — 1. We 
now extend the definition of the sequence d{i) to all the values of i < n — 1, constructing a 
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sequence {da{i))^Zi which will coincide in the range i < la with the sequence da{i) defined in 
the previous subsection. We first note that in the terminology of the exploration process, the 
sequence {da{i))i<i^ can be constructed as follows. At time Ta{i + 1), the half-edge adjacent to 
the i + 1-th vertex is chosen uniformly at random from the set of all the half-edges adjacent to 
a vertex out-side B, and d{i + 1) is the forward-degree of the vertex adjacent to this half-edge. 
Thus, the sequence {d{i))i<i^ has the following description. 

Initially, associate to all vertex j a set of dj half-edges (corresponding the set of half-edges 
outside B and L). At step 0, remove the half-edges corresponding to vertex a. Subsequently, 
at step k < la, choose a half-edge uniformly at random among all the remaining half-edges; if 
the half-edge is drawn from the node j's half-edges, then set da{k) = dj — 1, and remove the 
node j and all of its half-edges. Obviously, this description allows to extend the definition of 
da{i) to all the values of Iq < i < n — 1. Indeed, if /a < n — 1, there are still half-edges at 
step Ia + 1, and we can complete the sequence da{i) for i G [la + l,n — 1] by continuing the 
sampling described above. In this way, we obtain a sequence (c?a(0)r=/ which coincides with 
the sequence defined in the previous section for i < la- 

We also extend the sequence Sa{i) for i > la thanks to ([6]). Recall that, we set Xa{i) = 
Xa{Ia) for all i > la- It is simple to see that with these conventions, the relation ([7]) is not 
anymore valid for i > la but we still have Sa{i) < Sa{i) — 2Xa{i) for all i. 

The process i i— )• Xa{i) is non-decreasing in i € [l,n — 1]. Moreover, given J-j'^(j), the 
increment Xa{i -|- 1) — Xa{i) is stochastically dominated by the following binomial random 
variable 

Xa{i + 1) - Xa{i) <st Bin (da{i + 1), , (IQ) 

where m^") = Yl^=i di- We recall here that for two real- valued random variables A and B, we say 
A is stochastically dominated by B and write A <st B if for all x, we have ¥{A > x) < ¥{B > x). 
If C is another random variable, we write A <st [B \ C) if for all x, ¥{A > x) < F{B > x\C) 
a.s. 

We have the following useful lemma. 
Lemma 2.2. For i < ^, we have 

V (■\ ^ T>- ( c io\ , • max£<jga(^) ^ 

Xa{i) <st Bm max 5a W + ^tt • (H) 

\ i<i n — 2i I 

A proof of this lemma is given in Section IA.4I Note that if i > la, then Sa{i) = and 
Xaii + 1) - Xa{i) = 0, so that 1^0^ is still valid. 

An important ingredient in the proof will be the coupling of the forward-degrees sequence 
{d{i)} to an i.i.d. sequence in the range i < that we provide in the next subsection. 

Recall that we defined On and /3„ as follows (c.f. Equation ^) 



an = log^ n, and /3n = 3\ — ^—rn log n 
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2.2 Coupling the forward-degrees sequence da{i) 

We now present a coupling of the variables {da{l), ■■■,da{k)} valid for k < f3n, where /3„ is 
defined in Equation ([5]), with an i.i.d. sequence of random variables, that we now define. Let 
An := maxjg[i „] dj. Note that by Condition ll.il (ii). we have A„ = 0{n^/'^~'^). 

Denote the order statistics of the sequence of degrees (d-"^) by 

"(1) - ^^(2) - - "(n) • VJ-^J 

Define m(") := Er=f" d^(^) and let vr^") be the size-biased empirical distribution with the f3n 
highest degrees in (fT2]) removed, i.e.. 



TT^ ^ 



d^") = yfc + 1 



Similarly, define m'^"-' := X^iL(/3„+i)A„ ^[r? ^'^'^ ^^"^^ '-'^ ^'^^ size-biased empirical distribution 



\i) 

with the (/3„ -|- 1)A„ lowest degrees in ()12p removed, i.e 



i:r={,„H-i)A„(^+i)i 



jjlin) 



Note that by Condition II. H we have (in^n = o{n) which implies that both the distributions 
vr^") and vf^"^ converge to the size-biased distribution q defined in Equation ([T|) as n tends to 
infinity. 

The following lemma shows that the forward-degree of the i-ih. vertex given the forward- 
degrees of all the previous vertices is stochastically between two random variables with lower 
and upper distributions vf^"^ and vr^"^ defined above, provided that i < Pn- More precisely. 

Lemma 2.3. For a uniformly chosen vertex a, we have for all i < Pn, 

d!^^ <st (da{i) I 4(1), ...,da{i- I)) <st , (13) 



where D^[^^ (resp. of^^ ) are i.i.d. with distribution n^'^''^ (resp. ■k^"'^). In particular, we have 
for all i < j3n, 



fc=l fc=l k=l 

A proof of this lemma is given in Appendix lA. 51 



3 Proof of the Upper Bound 

In this section we present the proof of the upper bound for Theorem 1 1.31 Namely we prove that 
for any e > 0, with high probability for all vertices u and v which are in the same component 
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(i.e., such that distw{u,v) < oo), we have 

disty,{u,v) < (^— r + T^TT — T)(l + e)logn, 

\lJ-l T [dram) J 

where T{drain) is defined in ([3]). 

Recall that we defined an = (logn)^ and /3.„ = Sy^^j^n log n. The proof will be based on 

the following two technical propositions. For the sake of readability, we postpone the proof of 
these two propositions to the end of this section. 

The first one roughly says that for all u and v, the growing balls centered at u and v 
intersect w.h.p. provided that they contain each at least (3n nodes. More precisely. 

Proposition 3.1. We have w.h.p. 

distw{u,v) < Tu{f3n) + T^{l3n), for all u and v. 

The above proposition shows that in proving the upper bound, it will be enough to control 
the random variable T„(/3„) for each node u in V . It turns out that in the range between a„ 
and /3„, in the three cases d^ain ^ 3, d^am = 2, and d^iin = !> Tu{k) have more or less the same 
behavior, namely, it takes time at most roughly half of the typical (weighted) distance to go 
from size a„ to /3„. More precisely. 

Proposition 3.2. For a uniformly chosen vertex u and any e > 0, we have 
P (^r„(/3„) - Tu{an) > ^^^l^l^"^'' I In > an) = o{n-'). 

The conditioning > On is here to ensure that the connected component which contains 
u has size at least a„. In particular, note that one immediate corollary of the two above 
propositions is that two nodes whose connected components have size at least an are in the 
same component (necessarily the giant component), and that the two balls of size /3n centered 
at these two vertices intersect w.h.p. 

Using the above two propositions, we are only left to understand T„(a„), and for this we 
will need to consider the three cases separately. Before going through the proof of the upper 
bound in these three cases, we need one more result. Consider the exploration process started 
at a vertex a. We will need to find lower bounds for Sa{k) in the range 1 < A; < an- Recall 
that we defined 7a (/c) as the number of nodes of forward-degree at least two in the growing 
balls centered at a, c.f. Equation ([8|) for the precise definition. These nodes are roughly all the 
ones which could contribute to the growth of the random variable Sa{k). Now define the two 
following events. 

Ra ■■= {Saik) > daiin + 7a{k), for all < A; < a„ - 1 } , 
R'^ := {Sa{k)> ^a{k), foi allO<k< an -1}. 

Lemma 3.3. Assume da > 2 and da{i) > 1 for all 1 < « < a^. Then we have 

P(i?J 4(l),...4(n- 1)) > l-o(Wn/n), (14) 
P(Kl^a(l),...4(n-l)) > l-o{n-'^'^). (15) 

In particular, P(i?a) > 1 - o(log^°n/n) and f'{R'a) > 1 - o(n-^/2)_ 
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Proof. Since da{i) > 1, Saik) is non-decreasing in k. We have for all k ^ ct^, 

dmin + la{k) < da + -fa{k) < Sa{k) < OnAn = o(n), (16) 

and moreover, maxfc<Q,^ Sa{k) = Sa{oin)- Since da > 2 and Sa{k) = Sa{k) — 2Xa{k), we have 

{Xaian) = 0} C Ra, {X,(q„) < 1} C K- 

Note that the inequalities in (jl6p are true for any sequence such that 1 < da{i) < A„. In 
particular, in the rest of the proof we condition on a realisation of the sequence 

d = {da,da{l), . . . da{n - 1)). 

We distinguish two cases depending on whether or not Sa{an) is smaller than Sa^- Denote 
this event by Q (and its complementary by Q'^), i.e., 

Q:= { Sa{an) < 3an }• 

• Case 1) Sa{ctn) < "ioin- Conditioning on Q, by Lemma 12.21 we have 

3q„ 



Thus, we have 



Xa{an) <st Bin 4a„, 

n — lUn 



'Xa{an)>l\QA)< P(Bin(4a„,^^)> ij < 0{al/n), 
Xa{an)>2\Q,d) < p(Bin(4a„,^^) > 2) <0{ai/n^). 



We infer that 



¥[{Rar\Q,d) < 0[al/ 
P(«riQ,d) < 0{ai/ 



n] 



• Case 2) S'a(a„) > 3a„. Note that in this case, we still have maxfc<Q,,j Sa{k) = 5a(a„) < 
a„A„ = o(n). Moreover, there exists k < On such that for all I < k, Sa{t) < 3a„, and 
Sa{k + 1) > 3an- Note that since we have conditioned on the degree sequence d, the 
value of k is deterministic (k is not a random variable). Conditioning on the event Q^, 
we obtain by Lemma 12.21 

Xa{k) <st Bin (^4a„, -^^^ , and (17) 

Xaion) <st Bin ( an(A„ + 1) 



n — 2an 
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By Condition 11.11 (ii). there exists a e > such that A„ := ©(n^/^"*^). Let m = [2e~-'^]. 
Combining the last (stochastic) inequality together with the Chernoff 's inequality applied 
to the right-hand side binomial random variable, we obtain 

¥{Xa{an)>m\Q',d) < p (^Bin (^a„(A„ + 1), -^^^^ > 

= 0{{Alal/nr)=o{n--'). 

We notice that for all £ > A;, we have Sa{i) > 2an — 2Xa(a„). Also for n large enough, 
we have 2an — 2m > d^i^ + 7a (^)- Therefore, 

{Xa{k) = 0, Xa{an)<m, Q'} C RaDQ'^, and 
{ Xa{k) < 1, Xa{an) < m, } C Kn 

This in turn implies that 

¥{{Rar\Q^,d) < F{Xa{k)>l\Q^)+F{Xa{an)>m\Q^) <0{al/n) 
P(«)^|Q^d) < F{Xa{k)>2\Q^)+F{Xa{an)>m\Q') <0{ai/n^). 

In the above inequalities, we used (stochastic) Inequality (fT7|) and Case 1 to bound the 
terms P (^^(A;) > 1 | Q^^) and ¥{Xa{k) > 2 | Q"). 

The lemma follows by the definition of □ 

We are now in position to provide the proof of the upper bound in the three different cases 
depending on whether dmin > 3, d^iri = 2, or dmin = 1- Recall, for the ease of reading, the 
definition of the two events Ra and R'^ that we will use throughout the proof below. 

Ra = {Sa{k) > dmin + 7a(^), for aU < /c < a„ - 1 } , 
K = {Sa{k) > 7a{k), for aU < < a„ - l}. 

The proof will be based on the analysis of these events. In particular, to justify that we 
have to consider these three different cases, we note that in the case dmin > 3, the value of 7a (A;) 
is always k, while in the case dmin = 2 we need to control the length of the paths consisting of 
the vertices of degree two which contribute to the value of 7a (A;), and in the case dmin = 1 we 
need to do a kind of similar analysis as in the case dmin = 2 but in a modified configuration 
model which consists of the 2-core of the graph. We also emphasize that one other important 
difference between the case dmin ^ 3 and the two other cases dmin = 1,2 is that in the former 
case, as we will prove, the graph is connected with high probability, while in the two later cases 
c^min = 1)2 we also need to consider the small components of the 2-core. In addition, in the 
case dmin = 1 we need to consider the vertices which are connected to the small components of 
2-core and also, the tree components. 

In the following, we will use the following property of the exponential random variables, 
without sometimes mentioning. IfY is an exponential random variable of rate n, then for any 

e < fi, we have E [e^^] = 
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3.1 Proof of the upper bound in the case d^am ^ 3. 

Let a be a node of the graph. Consider the exploration process defined in Section 12.11 First, 
note that in this case, the conditions da{i) > 1 of Lemma 13.31 are automatically verified. Thus, 
as an immediate corollary, we obtain 

Corollary 3.4. We have P(/a > an) > 1 - o(n-3/2). 

Proof. Indeed, for dmia > 3, we have 7a (A;) = A; so that 

K Q {la > an} = {Sa{k) > 1, for all < A: < a„ - 1}. 

Now apply Lemma 13.31 □ 

We will need the following lemma. 

Lemma 3.5. For a uniformly chosen vertex a, and any e,i > 0, we have 

^{Ta{an) >elogn + e) =0(^-1+6"'^--^). 

Proof. Recall that given the sequence Saik), for k < la, the random variables Ta{k + 1) — Ta{k) 
are i.i.d. exponential random variables with mean Sa{k)~^. 

Assume first that R'^ holds and consider the following two cases based on whether the event 
Ra holds or not. 

• Case 1) Ra holds. By the definition of Ra, we have Sa{k) > (imin + k for all A; < — 1. 
Conditioning on Ra, we have for any k < an, 

Ta{k + 1) - Ta{k) <st Yk = Exp (dmin + A:) , 

and the random variables are all independent. Hence, we have 



E 



< 



n 1+ 



dr, 



k 

Ctn — 1 



drain ^ ^ 



fc=l 



k=l 

< exp 

< a^min = (logn)^'^™'", 

for n large enough. Thus, by Markov's inequality, we have for any e > 0, 

F{Taian)-Ta{l)>elogn + i\Ra) < (logn)3"'-'"exp(-d„,i„(elogn + £)) 

(log n)'^'^™'" 



o e 



By assumption (since Ra holds), S'a(O) > dmm, and so we also have Ta{l) <st Exp((in 
Therefore, 

P(ra(l)>6logn + £|i?a)<'''P^"^'^"^''^^ 



n 
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and we conclude 

P(ra(a„) > elogn + £| i?,) = o(e-'^-'-^). 

Case 2) Ra does not hold. In this case, for any k < an, by conditioning on {RaYr\R'^, 
we have 

Ta{k + 1) - Ta{k) <st Yk = Exp{k), 

and all the Y^'s are independent. We have 



E 



g(T4an)-T.{2)) \RC^nR', < n + - 1)) 

fc=2 



n. 

k=2 

By Markov's inequality, we have 

f{Ta{an)-Ta{2)>elogn + l\RlnR'^) < log^ n exp(-e log n - = o(n-^/2). 

We also have Ta(l) <st Exp(l) and Ta(2) — ra(l) <st Exp(l), and we conclude in this 
case 

r{Ta{an)>elogn + e\RlnK) =o(n-^/2). 

Putting all the above inequalities together, we have 

¥{Ta{an)>elogn + i) < 1 - P«) + (1 - P(i?<,))n-^/2 ^ 0(6"'^-"^) 

as desired. □ 
We can now finish the proof of the upper bound. By Proposition 13. H we have (w.h.p.) 

flood^(G(n, (dj)")) = max{dist„,(a, 6), 6 G y, disttt,(a, 6) < 00} (18) 

< Ta {f3n A /a ) + Hiax Tb{f3n/\Ib), and 

b 

diamu,(G(n, (dj)")) = max{dist^(a, 6), a, 6 G y, dist„,(a, 6) < 00} (19) 

< 2maxr„(/3„ A/J, 



where a is chosen uniformly at random in (|18p . 

By Proposition 13.21 and Corollary 13. 4| and Lemma [3 . 5 1 applied to £ = elogn, we obtain that 
for a uniformly chosen vertex a and any e > 0, we have 



r(Ta{(3n) > -^^ (l + e) logn + 2elognj = o(l). 



(20) 
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Indeed the above probability can be bounded above by 



P(?;(a„) > 2elogn) + P ( r„(/3„) - r„(a„) > 2(uVi) ^°g^ I 4 > ) + < ar, 



and this is o(l) by the above cited results. 



Furthermore, by Proposition 13.21 and Corollary 13.41 and Lemma 13.51 applied to ^ = we 



obtain that for a uniformly chosen vertex b and any e > 0, we have 

Indeed the above probability can be bounded above by 

Tb{an) > ^ logn ) + P [Tbi/Sn) - n{an) > 1^^^ logn | h > ) + Hh < 

dmin y V 2(1/ - 1) J 

and this is o(n~^) by the above cited results. 

Applying Equation (j2ip and a union bound over 6, we obtain 



V6, W„)<(^ + ^)(l + .)log„)=l-o(l). (22) 

We conclude by Equations ([l8]), ([20]), and ([22]) that w.h.p. (for any e > 0) 

fiood^(G(n,(dO^)) ^ 1 ^ 1 . 

log n ~ u — 1 d ■ 



mm 



Clearly, the two equations ([T9|) and (p2|) imply the bound on the diameter, that w.h.p. 
diam^(G(n,(dO?)) ^ , 1 , 2 

log n ~ ^U-l drain 

The proof of the upper bound in this case is now complete. 

We end this section by the following remark on the connectivity of the random graph. 
We note that the above arguments show that the graph G{n, (dj)"), satisfying Condition 11.11 
and dmin ^ 3, is connected w.h.p. Indeed by Lemma 13. 3^ R'^ holds with probability at leat 
1 — o(n~^). With probability 1 — o{n~^), for a uniformly chosen vertex a, we have Sk{a) > 1 
for all 1 < /c < On- By a union bound argument, with probability 1 — o(l) the size of the 
growing ball centered at node a reaches a„ for all nodes a in the graph. Using Lemma 13.101 
shows that for all nodes, this cluster also reaches The connectivity then follows by applying 
Proposition 13. 1[ 



3.2 Proof of the upper bound in the case d^am = 2 

Consider the exploration process defined in Section [2.11 starting from a. Recall the definitions 
([8]) and 7a (^) is the number of nodes with forward-degree (strictly) larger than one until 
the i-th exploration step, and Ta{k) is the first time that the k-ih node with the forward-degree 
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(strictly) larger than one appears in the exploration process started at node a. We also define 
the sets 

La{k) := {i, Ta{k)<Ta{e+l)<Ta{k + l)}, 

for A; > 0, and let na{k) be the smallest I in La{k). Clearly, we have na{k) > k — 1 and 

j-\k) = La{k) = [na{k),na{k + 1) - 1]. 

For X, y S M, we denote x Ay = min(2;, y). We will need the following lemma, equivalent to 
Lemma 13.51 (in the case dmin > 3) . 

Lemma 3.6. For a uniformly chosen vertex a, any x > 0, and any i = O(logn), we have 
' Ta{an A la) > X log n + A <o(n-i) + o(e-2(i-'?i)^). 



Ta{an) 



Proof. Recall that given the sequence Sa{k), for k < la, the random variables Ta{k + 1) — Ta{k) 
are i.i.d. exponential random variables with mean Sa{k)~^. First write 

= Yl Ta{3 + l)-TaU) 
0<j<an 

< T„(A; + l)-r,(A:), 



fc<A'„ 



(23) 



where Kn is the largest integer such that na{Kn) < an- 
We now show that for any x > and (. = O(logn), 

^[Ta{an)>x\ogn + ^,Ra) =0(6-2(1-"!)^). 

Remark that a sum of a geometric (with parameter vr) number of independent exponential 
random variables with parameter ji is distributed as an exponential random variable with 
parameter (1 — 7r)/i. For any k < Kn, we have: 

Ta{k + 1)-Ta{k)= J2 Ta{j + 1)-Ta{j) 

Assume Ra holds, then we have Sa{j) > 2 + A; for all j G [na{k),na{k + 1) — 1] = La{k). Thus, 

Ta{j + 1) - Ta{j) <st Yk,i ~ Exp(2 + k), 

where i = j — na{k) + 1, and all the Y^/s are independent, (for i = 1, . . . , \La{k)\, Y^^i are 
exponential random variables with rate 2 + k.) 
For any positive t and 6, we obtain 



Ta{an)-Ta{l)>t,Ra) < E 



E 



HRa) n e^(Wi)-T.W)|d„...,4(n-l) 



E 



l<k<Kn 



-et 



]j e^ntr'^y..^[R^\da,...,da{n-i] 

l<k<Kn 



< 



n h+ 



l<fc<Q„ 



{2 + k){l-Tr_^'^)-e 
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where in the last inequahty, we used the fact that the probability for a new node to have 
forward-degree one is at most vr["\ and so the length |La(A;)| is dominated by a geometric 
random variable with parameter vr^^^^ Taking 9 = 2(1 — vr^"^) in the above inequality, we get 

T„K)-T„(i) < n fi + f^)^"'--"'" 

= n (fc + 2)Ae-2(i-.("')t 

l<k<an 

In the same way, we can easily deduce that 

{Tail) I Ra) <st Exp(2(l-7rf))). 

Let t = X log n + and note that i < C log n for some constant C > (by assumption 
£ = O(logn)). Take any < e < x{l — qi){C + x)~^; since for n sufficiently large, we have 
TLi"^ < 91 + e, we obtain 



-2(l-gi)£ 



and ()23p follows. Note that x{l — qi — e) — eC > by the choice of e. 

Assume now that the event R'^ H R'^. holds. Two cases can happen: either < Q!„ or 
la > an- 

If la < On, then by the definition of R'^, = Sa{Ia) ^ la{Ia)-, i-e-, "Jaila) = 0. In other words, 
the component of a is a union of cycles (or loops) having node a as a common node, and with 
total number of edges less than a„. Hence, in this case, we have 



a;logn+^ 



Rl, la < an,Ta{Ia) > xlogn + i 

<p(i?^ |4,...,4(n-l)) I Yl (^^^)' / 
< l^i^(l-^W)-iexp(-(l-7r("^)(xlogn + £)) = o(n-^) 



where the last inequality follows from Inequality (jl4p in Lemma 137 
In the second case, when la > otn, let 

Q = R'ar\Rlr\{la>an]. 

If Q holds, by the definition of i?^, we have Sa{j) > k for all j £ La{k). Thus, 

Taii + 1) - Ta{j) <st Yk,; ~ Exp(A;), 
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where i = j — na{k) + 1, and all the Ifc^j's are independent, (for i = 1, . . . , \La{k)\, Y^^i are 
exponential random variables with rate k.) Hence, by the same argument as above, we have 



Ta{an) - Ta{2) > t, Q < E 



< E 



E 



1(Q) n e'^^-^'^'^-^^'''^^\da,...,da{n-l) 

2<k<Kn 



-et 



2<k<Kn 



-et 



2<k<a„ 



n 



where the last inequality follows from Inequality (I14p in Lemma 13.31 Thus, taking 9 = 1 — t^i 
gives 

T.K)-T.(2)>t,Q,)< n (l + ^)e-(--i"')*o^^°^"^ 

2<k<an 



in) 



n 



/log^ 



-( 

n I \ n 

Since da^^, we can easily deduce that 

{Ta{2) I Q) <st Exp(2(l-7rS"))) + Exp(l-^[")), 
with these two exponential being independent and independent of Q. Hence, we have 



Ta(2)>t| q) < jJ""2(l-vrS"))(e"(i"^i"')--e-2(i-^^"')-) <2e^ 



Thus, 



Ta(a„)>t,Q < e 



/log^ 



n 



Similar to the case where Ra holds (by fixing a constant e small enough and using that for n 
sufficiently large t^^^ < qi + e ioi n large enough) , we get 

log^^ n 



Ta{an)>x\ogn + ^,Q] < o 



n 



l+(l-gi-e)C 



o(n-i). 



Putting all the above arguments together, and considering the three disjoint cases {R'aY 
holds, Ra holds, and R'^r\Ra holds (in which case either la < an or > a„), we conclude that 

P ( Taian A /a) > X log 71 + ^ ) < 0(6-2(1-"!)^) + o{n-^) + 1 - F{R'J. 

To conclude the proof it suffices to use Lemma 13.31 □ 
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We can now finish the proof of the upper bound in the case (imin = 2. By Proposition [321 

log n 

2(i-gi)' 



and Lemma 13.61 applied to ^ = „.l°^" v , we obtain that for a uniformly chosen vertex a and any 



e > 0, we have 

p(^oo>r,(/3„A/J> + + log =o(n-i). (24) 

Indeed the above probability can be bounded above by 

Taian A la) > r log n + P (an) > TT, TT log"- \Ia>an 

2(1 -gi) J V 2(1/ -1) 

and this is o{n~'^) by the above cited results. 

Applying Equation (j24p (and Lemma |3.6|) and a union bound over a, we obtain 

Va, r,(/3„A/„)< (^-^-L_ + _l_^(l + e)logn^ = 1 - o(l). (25) 

Hence by Proposition 13. H we have w.h.p. 

diam^(G(n, {di)^)) ^ ^ e)(^— + ^ ) 
log n ~ z^— 1 \ — qi 

This proves the bound on the diameter. To obtain the upper bound for the flooding, we use 
Equation (p5]) . and proceed as above by applying Proposition 13.21 and Lemma [3?6] applied to 
H = e log n, to obtain that for a uniformly chosen vertex 6, we have 

niPnMb) < ( 2(!.t'i) +g)log^) = (26) 

Clearly, Equations and (f2U|) imply the bound on the flooding. 
The proof of the upper bound in this case is now complete. 



3.3 Proof of the upper bound in the case dmin = 1 

In this section, we will need some results on the 2-core of the graph. Basic definitions and 
needed results are given in Appendix I A. 2 1 

We denote by Ca the event that a is connected to the 2-core of G„, ~ G(n, (c?j)i ). It is 
well-known (c.f. Section [A.2p that the condition u > 1 ensures that the 2-core of Gn has size 
Q{n), w.h.p. We consider the graph Gn(a) obtained from G„ by removing all vertices of degree 
one except a until no such vertices exist. If the event Ca holds, Gn{o) consists of the 2-core of 
Gn and the unique path (empty if a belongs to the 2-core) from a to the 2-core. While, if the 
event holds, then the graph Gn{a) is the union of the 2-core of G„ and the isolated vertex 
a. 

In order to bound the weighted distance between two vertices a and h, in what follows, we 
will consider two cases depending on whether both the vertices a and h are connected to the 
2-core (i.e., the events Ca and Cf, both hold), or both the vertices a and h belong to the same 
tree component of the graph. In the former case, we will show how to adapt the analysis we 
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made in the case dmin = 2 to this case. And in the latter case, we directly bound the diameter 
of all the tree components of the graph. 

First note that G„(a) can be constructed by means of a configuration model with a new 
degree sequence d, c.f. Section lA. 21 with di > 2 for all i a. Consider the exploration process 
on the graph Gn{a) and denote by Ta{i) the first time the ball Bw{a,t) in Gn{a) reaches size 
i + 1. Also, la is defined similarly to la for the graph Gn{a). We need the following lemma. 

Lemma 3.7. For a uniformly chosen vertex a, any x > and any i = O(logn), we have 

¥(TaianAia)>xlogn + i^ < o(n-i) + o(e"(^-^*)^). 

Proof. First note that if Ca does not hold, i.e., if a is not connected to the 2-core, we will have 
la = (since da = 0), and there is nothing to prove. Now the proof follows the same lines as 
in the proof of Lemma 13.61 Note that conditional on Ca, we have da > 1, hence by Lemma 13.31 
we have f{Ra \ Ca, d) > 1 — o(log^'^ n/n), and similarly for R'^. The only difference we have to 
highlight here, compared to the proof of Lemma 13.61 is that conditional on Ra CiCa, we have 
Saij) > 1 + A; for all j £ La{k), where Sa{j) and La{k) are defined in the same way as Sa{j) 
and La{k) for the graph G{a). Take now 6 = 1 — in the Chernoff' bound, used in the proof 
of Lemma 13.61 where is defined as -Ky^' for the degree sequence (4"\...,4"^). The rest 
of the proof of Lemma 13.61 can then be easily adapted to obtain the same result provided we 
replace 2(1 — gi) by (1 — A*), which is precisely the statement of the current lemma. (Note that 
A* = qi, c.f. Section [A. 21 ) □ 



By Proposition 13.21 applied to the graph Gn{a) (note that i> = i/, cf. see Section [A.2p . and 

log n 



Lemma |3 . 71 applied to £ = , we obtain that for a uniformly chosen vertex a and any e > 



we have 



(27) 



Indeed the above probability can be bounded above by 

Ta{an A la) > log + P (^faiPn) " ?„(«„,) > ^^J^ Ua > "n 

and this is o(n~^) by the above cited results. 

Applying Equation [27] (and Lemma 13. 7p and a union bound over a, we obtain 

Va, TaWnAla) < ( ^_ + jA^) (l + e)logn^ =l-o(l). (28) 

To obtain the upper bound for the flooding, we use Equation [28l and proceed as above by 
using Lemma [3771 applied to £ = elogn, to obtain that for a uniformly chosen vertex b, we have 

nifinAh) < ( 2(l-'l) = (29) 



22 



Clearly, the two equations ([25]) and together with Proposition 13.11 (since Ta{k) > Ta{k) 
for all k), imply the desired upper bound on the (wighted) flooding and (wighted) diameter on 
the giant component of G„ and also on every components containing a cycle, i.e, connected to 
2-core. 

At this point, we are only left to bound the (weighted) diameter and the (weighted) flooding 
of the tree components. In particular, the following lemma concludes the proof. 

Lemma 3.8. For two uniformly chosen vertices a, b, and any e > 0, we have 
Y^-^^ogn < dist^{a,b) < oo, C^, = o{n~'^). 

Proof. We consider the graph Gn{a,b) obtained from G„ by removing vertices of degree less 
than two except a and b until no such vertices exist. As shown in Section \K.2\ the random 
graph Gn{a,b) can be still obtained by a configuration model, and has the same asymptotic 
parameters as the random graph G„(a) in the proof of the previous lemma. We denote again 
by d, the degree sequence of the random graph Gn{a, b). Also, Ta and la are defined similarly 
for the graph Gn{a,b). 

Trivially, we can assume da = 1 and di, = 1, otherwise, either they are not in the same 
component and so dist^(a, 6) = oo, or one of them is in the 2-core, i.e., one of the two events 
Ca or Cfe holds. Consider now the exploration process started at a until time k* which is the 
first time either a node with forward-degree (strictly) larger than one appears or the time that 
the unique half-edge adjacent to b is chosen by the process. Let v* be the node chosen at k*. 
Note that d^* = 1 if and only if the half-edge incident to b is chosen at k*. We have 

1 + e , 



1 - A. 



= r(^fa{k*[ 


)> 


1 + e 
1 - A. 


in,v* = 


b,da 


<p(f„(r: 


) > 


1 + e , 
1 - A. 


in,v* = 


b 1 d, 


= p(T„(r: 


) > 


1 + e , 
1 - A. 


in \ da 


= db 


p ( 


dv* 


= 1\ d^* 


^2,da- 


= db 



= o(n-2). 

To prove the last equality above, first note ¥{di,* = 1 \ d^* ^ 2,da = db = I) = O(^), this 
holds since v = v > 1 and v* will be chosen before o(n) steps, i.e., k* = o{n) (we will indeed 
prove something much stronger, that k* = O(logn), c.f. Lemma l4. II in the next section). And 

second, P (^Ta{k*) > logn | da = db = 1^ = o(l/n), this follows by the same argument 
as in the proof of Lemma [3771 applied to Gn{a, b), and by setting £ = (i+£H2II^;. □ 

The proof of the upper bound in this case is now complete by taking a union bound over 
all a and b. We end this section by presenting the proof of Proposition 13.11 and Proposition 13.21 
in the next subsection. 
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3.4 Proof of Proposition 13.11 and Proposition 13.21 

We start this section by giving some preliminary results that we will need in the proof of 
Proposition 13.11 and Proposition 13. 2i 



Lemma 3.9. Let D_["'^ be i.i.d. with distribution vr^'") . For any 7] < v, there is a constant 7 > 
such that for n large enough we have 



+ --- + D}^'' <kr]) <e-^^. (30) 

Proof. Let D* be a random variable with distribution ¥{D* = k) = qk given in Equation ([T]) 
so that E[D*] = u. Let (t){9) = E[e~''-^*]. For any e > 0, there exists ^0 > such that for any 
9 € (0, ^0)5 we have 



log 0(0) < {-u + e)e. 

yj=n-/3„+l "'(i) 



By Condition II . 1 1 and the fact that /3n^n = o{n), i.e., 11 d^^) = o{n), we have for any 



e>o, 

lim (^(")(0) = 0(0), 

n— >oo — 

where 0(")(0) = E[e-^^i"']. Also, for 9 > 0, 

P (^^W + . . . + D^^^ < nk^ < exp (^k [Ot] + log 0(") (0)) ) . 
Fix 9 < 9q and let n be sufficiently large so that log 0^"^ {9) < log cj){9) + e. This yields 

P (d^^ + • • • + D^^ < rjk) < exp {k {9ri + log 0(0) + e9)) 



< exp (/c0 (?7 - 1/ + 2e)) , 
which concludes the proof. □ 

The following lemma is the main step in the proof of both the propositions. 
Lemma 3.10. For any e > 0, define the event 



K ■■= !^Sa{k) > '^k, for all a„ < A: < /3„| . 



For a uniformly chosen vertex a, we have F{R'^\ Ia>an) > 1 - o{n-^). 

Before giving the proof of this lemma, we recall the following basic result and one immediate 
corollary, for the proof see for example \27\ Theorem 1]. 

Lemma 3.11. Let ni,n2 G N and pi,p2 G (0,1). We have Bin(ni,pi) <st Bin(n2,P2) if and 
only if the following conditions hold 

(i) ni < 722, 
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(ii) > (1-P2)"^. 

In particular, we have 

Corollary 3.12. If x < y = o{n), we have (for n large enough) 



X - Bin(x, y^x/n) <st y - Bm(y, y^y/n). 
Proof. By the above lemma, it is sufficient to show 

{x/nf/^ > {y/n)y/\ 

and this is true because is decreasing near zero (for s < e~^). □ 
Now we go back to the proof of Lemma 13.101 

Proof of Lemma \3.10l By Lemmas 12.31 and 13. 9^ for any e > 0, k > On and n large enough, we 
have 



da{l) + ... + da{k) < YTTJ^^j - ^ = '"^'^ >■ 

We infer that with probability at least 1 — o(n~^), for any k < I3n, 

^~^-k <da + 4(1) + ••• + da{k) - k <{k + 1)A„ = o(n). 



l + e/2 

By the union bound over k, we have with probability at least 1 — o(n~^) that for all an < k < Pn, 

" S < Sa{k) <{k + 1)A„ = o(n). (31) 



l + e/2 



Hence in the remaining of the proof we can assume that the above condition is satisfied. 

By Lemma [27T1 Corollary 13.121 and Inequalities (j3T]) . conditioning on Saik) and {la > k}, 
we have 



I/- 1 



>o+ —k — Bin ( uk, ^/ uk/n] . 

- l + e/2 V ' ^ 'J 



By Chernoff's inequality, since ky^k/n = o(/c/y^a^), we have 

P (^Bin(i/A;, ^uk/n) > k/^/E^^ < exp (^-^k/./a^^ = o{n~^). 
Moreover, conditioned on {la > k}, we have with probability at least 1 — o{n~^), 



' - l + e/2 Jd^ - 1 + e ' 
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for n large enough. Defining 

K{k) := I Sa{k) > I for a„ < < 

so that <' = nfla„ Kik), we have 

<(A:) I /a > fc) > l-o(n-6). (32) 



Thus, by using the fact that R'^{k — 1) C { /a > A; }, we get 

/ A. 

Ra\Ia>an) = 1 " P J ' ^ 

\k=an 

(Pn 
Ki^nY U U i^Kikf n i?::(A: - 1)) \Ia>an 
fc=a„+l 

> 1 - P KM U IJ n { > A; }) I /, > Q„ 

y fe=a„+l 

> 1- p(<(A;r I /a> A; 

fc=a„ 

> 1 - o(n-5), 

which concludes the proof. □ 

We are now in position to provide the proof of both the propositions. 

Proof of Proposition CO! Fix two vertices u and v. We can assume that Tu{f3n),T^{/3n) < oo, 
i.e., Iu,Iv ^ /3n) otherwise the statement of the proposition holds trivially for u and v. Note 
that distw{u,v) < Tu{f3n) +Ty{/3n) is equivalent to 

S^(n,T„(/3„)) n S^(^;,r,(/3„)) / 0. 

Hence, to prove the proposition we need to bound the probability that By^{v,Ty{(3n)) does not 
intersect By,{u,Tu{/3n))- 

First consider the exploration process for B.u,{u,t) until reaching t = Tu{(3n)- We know by 
Lemma 13.101 that with probability at least 1 — o(n~^), 

SuiPn) >{V-1- o(l))/3n. 

(In other words, there are at least {v — 1 — o(l))/3„ half-edges in Bw{u,Tu{/3n))-) 

Next, begin exposing B^{v, t). Each matching adds a uniform half-edge to the neighborhood 
of V. Therefore, the probability that Bw{v,Ty{/3n)) does not intersect Bu,{u,Tu{/3n)) is at most 

(z/-l-o(l))/3„V" 



d2 9A71 log n 



< exp[— (9 — o(l)) log n] < n 



for large n (recall that /3„ = — " ) • The union bound over u and v completes the proof. □ 
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Proof of Proposition VS. 'id Conditioning on the event i?^' defined in Lemma 13.11^ we have for 
any an<k< /3„, 



Ta{k + 1) - Ta{k) <st Yk ~ Exp{Sa{k)) <st Exp 



u - 1 
1 + e 



k], 



and all the Y^s are independent. 

Letting s = ^/a^, for n large enough we obtain that 



E 



^s{Ta{M-Taia„)) \ R'^ < Y[ [1 + 



k=an 



1+e 



< n 0^^) 



< exp 



< 



exp 



I/- 1 ^ A; 

fc=a,. 



s(l + 3e) log n 



2(^-1) 



By Markov's inequality, 



2(i.-l) 



. exp 



gS(Ta(/3„)-Ta(a„)) I 
s(l + 4e) log n 



2{u-l) 



< 



exp 



se log n 
'2(^^-1) 



+ o{n ^) = o{n 



which concludes the proof. 



□ 



4 Proof of the Lower Bound 

In this section we present the proof of the lower bound for Theorem 11.31 To prove the lower 
bound, it suffices to show that for any e > 0, there exists w.h.p. two vertices u and v such that 

dist^(u,?;) > (1 - e) ( + — - ) logn. 

As in the proof of the upper bound, the proof will be different depending whether dmin = 2, 
or > 3. So we start this section by proving some preliminary results, including some new 
notations and definitions, that we will need in the proof for these three cases, and then divide 
the end of the proof into three cases. 

Fix a vertex a in Gn ~ G{n, {di)^), and consider the exploration process, defined in Sec- 
tion [27TJ Recall that Ta(l) is the first time when the ball centered at a contains a vertex of 
forward-degree at least two (i.e., degree at least 3), c.f. Equation-Definition Q. To simplify 
the notation, we denote by Ca the ball centered at a containing exactly one node (possibly in 
addition to a) of degree at least 3: 

Ca:=BUa,Ta{l)). (33) 
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Note that there is a vertex u (of degree dy_ > 3) in Ca which is not in any bah B^{a,t) for 
t < Ta{l) and we have max^,gc'^ dist^(a, v) = dist^(a, u). We define the degree of Ca as 

deg(C7,) = da + du-2. (34) 

Remark that at time Ta(l) of the exploration process defined in Section [2.11 starting from a, 
we have at most deg(Ca) free half-edges, i.e., the list L contains at most deg(Ca) half-edges, 
(we have the equality if the tree excess until time ^^(l) is zero.) The following lemma shows 
that the size of Ca is relatively small. 

Lemma 4.1. Consider a random graph C{n, {di)'^) where the degrees di satisfy Condition \l.l[ 
There exists a constant M > 0, independent of n, such that w.h.p. for all the nodes a of the 
graph, we have |Ca| < Mlogn. 

Proof. We consider the exploration process, defined in Section 12.11 starting from a uniformly 
chosen vertex a, and use the coupling of the forward-degrees we described in subsection 12.21 
Recall in particular that each forward-degree d{i) conditioned on the previous forward-degrees 
is stochastically larger than a random variable with distribution vr'^"-'. This shows that, at 
each step of the exploration process, the probability of choosing a node of degree at most two 
(forward-degree one or zero) will be at most ttq""^ + vr^"^ < 1 — e, for some e > (note that 
the asymptotic mean of vr^") is i/, and by assumption i' > 1). We conclude that there exists a 
constant M > such that for all large n, 

P(|C7„| > Mlogn) = o{n-^). 

The union bound over a completes the proof. □ 

For two subsets of vertices U,W C V, the (weighted) distance between U and W is defined 
as usual, 

distw{U,W) := min{ dist^(n, w;) \ u £ U, w e W ] . 
For two nodes a, b, define the event 'Ha,b as 

'Ha,b ■= ^log?z < dist^(Ca,C'fe) < ooj . (35) 

Note that is the typical distance, so the left inequality in the definition of the above 
event means that Ca and Cb have the right typical distance in the graph (modulo a factor (1— e)). 
The right inequality simply means that a and b belong to the same connected component. The 
following proposition is the crucial step in the proof of the lower bound, the proof of which is 
postponed to the end of this section. 

Proposition 4.2. Consider a random graph G{n, (dj)'") with i.i.d. rate one exponential weights 
on its edges. Suppose that the degree sequence {di)1 satisfies Condition Assume that the 
number of nodes with degree one satisfy n^"^ = o(n), and let a and b be two distinct vertices 
such that deg{Ca) = 0{1), and deg{Ch) = 0(1)- Then for all e > 0, 

IP(^a,6) = 1-0(1). 
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Furthermore, the same result holds without the condition deg(Ca) = 0(1) if the node a is chosen 
uniformly at random and deg{Cb) = 0(1)- 

Assuming the above proposition, we now show that 

(i) If the minimum degree dmin > 3, then there are pairs of nodes a and b of degree dmm 
such that the event 7ia,b holds and in addition all the weights on the edges adjacent to a 
or b are at least (1 — e) log?i/dmin w.h.p., for all e > 0. 

(ii) If the minimum degree dmin = 2, then there are pairs of nodes a and b of degree two such 
that 'Ha,b holds and in addition, the closest nodes to each with forward-degree at least 
two is at distance at least (1 — e)logn/(2(l — qi)) w.h.p., for all e > 0. 

(Hi) If the minimum degree dmin = Ij then there are pairs of nodes of degree one such that 
Tia,b holds and in addition, the closest node to each which belongs to the 2-core is at least 
(1 — e)logn/(l — A*) away w.h.p., for all e > 0. 

This will finish the proof of the claimed lower bound. 



Proof of the lower bound in the case d^[^ > 3. Let V* be the set of all vertices of degree 

d„ 



drain- We Call a vertex u in V* good if is at least 4—^ logn, i.e., 



Tu{l) > ^logn, 

and if in addition, deg(Cu) < K for a constant K chosen as follows. Let be a random variable 
with the size-biased distribution, i.e., ¥{D = k) = q^. The constant K is chosen in order to 
have with positive probability D < dmm — I + K, i.e., 

y = yK:=F(^D< dmin " 1 + A^) > 0. (36) 

It will be convenient to consider the two events in the definition of good vertices separately, 
namely, for a vertex u £ V* , define 

£u := jT„(l) > i^lognl, and (37) 



:= {degiCu)<K}. (38) 

We note that in the case d^m ^ 3, the event £u for u £ V* is equivalent to having a weight 
greater than 4—^ log n on all the dmin edges connected to u, and clearly, the two above events 
f„ and £u are independent (conditionally on u € V*, i.e., du = (imin)- 

For u € V*, let be the event that u is good, := £ur\£^, and let Y be the total number 
of good vertices, Y := ^Au- the following, we first obtain a bound for the expected value 
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of Y, and then use the second moment inequahty to show that w.h.p. Y = Q{n''). 



E[Y] = ^ p(^,) = n£u).n£'u) 

u&V* ui 



u£V* 



(By the independence of £u and £'^) 
(Since F{£u) = P f Exp(d^i„) > ^ logn 



u£V* ^ 

= (l±o(l))p,„^^„nV 

In the equahty before last one, we used the coupHng argument described in Section [2^2] to bound 
the forward-degrees from above (and below) by i.i.d. random variables having distributions 
Tf^") (and TT^"^), and then used the fact that the asymptotic distributions of both vf^"') and vr^") 
coincides with the size biased distribution {qk}- This is where the factor (1 it o{l))y in the 
above equality comes from. The last equality is simply obtained from Condition 11.11 which 
implies that \V*\ = (1 it o(l))p(i^^^.^^n. 

We now show that Var(y) = o(E[y]^). Applying the Chebyshev inequality, this will show 
that Y > with high probability. 

We have 



E[Y^ 



E 



E 



E 



< E 



(E 1^")' 



E 



(K + 2)E[Y] + E 



u,veV*:Cur\Cv=0 



where to obtain the inequality above we used the fact that for each node u, when the event 
holds, the degree of C„ is at most K and so H C^, 7^ happens at most for {K + 2) nodes 
V. Indeed, each consists of v and another node of the graph. A simple analysis shows that 
such a V either belongs to Cu (two possibilities) or is among the neighbors of Cu (at most K 
choices). 

Moreover, for any pair of vertices u,v £ V*, such that C^nC^ = 0, conditioning on A„ does 
not have much effect on the asymptotic of the degree distribution. Indeed, by the coupling 
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argument of Section 12.21 we have for u,v £ V* such that n C„ = 0, 

¥{A, I Au) = I ^„)P(4 I Au) = nSvMK I Au) 

> P(Exp(dmin) > ^ logn) F(;D^"^ < - 1 + ) 

where D^"^ is a random variable with distribution 7f("\ Similarly, we have 

F{A, I A J < P(Exp((i^in) > ^ logn)P(^(") < d^i, - 1 + JC), 

Omin 

where D_^"'^ is a random variable with distribution vr^") . We infer that 

¥{A, I Au) = (l±o(l))y n-i+S 

and then using the estimate P(A^) = {l±o{l))yn''~^ we obtained above, we obtain (conditioned 
on CunC^ = 0) 

P(A n Au) = (1 ± o(i))P(A„)P(^^). 

This shows that 

< (i+o(i)) n^uMAv) 

u,v£V* 

= {i+o{m[Yf. 

Hence, we have 

Var[F] = E[y2] _ E[Yf < {K + 2)E[Y] + o{K[Yf) = o{K[Yf). 

This finishes the proof of the fact that Y > with high probability. 

We consider first the fiooding, and obtain the corresponding lower bound. Let Y' denote 
the number of good vertices that are at distance at most 4—^ log n + log n from a vertex a 

"mill ^ 

(chosen uniformly at random). It is clear that the lower bound follows by showing that Y' <Y 
with high probability, i.e., Y — Y' > w.h.p. To show this, we will bound the expected value 
of Y' and use Markov's inequality. 

Since by Condition II. H V* has size linear in n by applying Proposition 14.21 we obtain that 
for a uniformly chosen vertex u £ V*, conditioning on A^, we have P(^a,u) = 1 — o(l)- Indeed, 
the two events Ha^u and £u are independent, and conditioning on £^ is the same as conditioning 
on deg(Cu) < K = 0(1). Therefore, for a uniformly chosen vertex u in V*, we have 



E 



u,vev*:CunCv=$ 



¥{Aunni,)=o{F{Au)), 

where 'H'^^ denotes the complementary event of Tia^i, i.e., the event that Ha^i does not occur. 
Thus, a straightforward calculation shows that 

E[y'] = o(E[y]) = o(n'). 
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By Markov's inequality, we conclude that Y' < ^p^^.^^n'^ w.h.p., and hence Y—Y' is w.h.p. posi- 
tive. This implies the existence of a vertex u whose distance from a is at least ( + -r~ ) (1 ~ 
e) logn. Hence for any e > we have w.h.p. 

/I 1 \ 

flood„,(G„) > max dist^(a,ii) > 1 (1 — e)logn. 

ueV* dmin/ 

We now turn to the proof of the lower bound for the (weighted) diameter of the graph. The 
proof will follows the same strategy as for the flooding, but this time we need to consider the 
pairs of good vertices. Let R denote the number of pairs of distinct good vertices. Recall we 
proved above that w.h.p. Y > |lE[y]. Thus, 

The probabilities that u and v are both good and Tiu^v does not happen can be bounded 
as follows. 

F{A^nA,nni^,) = p(^, n a,)p(K,J A,^^,) 

= ¥{Au n A„)F {m^, I deg(C„) < K, deg(a) < K) 

(We used the independence of 'Hu,v and £u and S^) 
= o(P(A„nA)). (39) 

The last equality follows from Proposition 14.21 since Cu and are of degree 0(1). 

To conclude, consider R' the number of pairs of good vertices that are at distance at most 
(1 - e)(2|Sj^ + ^). By using Equation 1^, we have ER' = o(E[Y]'^). Applying Markov's 
inequality, we obtain that w.h.p. R' < i(E[y])2, and thus, R - R' is w.h.p positive. This 
implies that for any e > 0, we have w.h.p. 

/I 2 \ 

diam^(Gn) > max dist^(ii, v) > I h - — I (1 — e)logn. 

u,v£V* — 1 amin/ 

Proof of the lower bound in the case dmin = 2. Let V* be the set of vertices with degree 
two. Again we call a vertex u in V* good if both the events £u and £^ hold. Recall the definition 
of the two events 

£u ■■= |r„(l) > ^^^^^lognj, and (40) 
:= {deg(C7„) < i^} . (41) 
where in this case we choose K in such a way that 

y:=¥ (^D <l + K \ D >2^ > 0. (42) 

The random variable D has the size-biased distribution {qk}- For a vertex u of degree two, 
we denote as in the previous case by Au the event that u is good, and define Y := Ia^ to 
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be the number of good vertices. The strategy of the proof wih be the same as in the previous 
case, the details change. So we wih obtains bounds for the expected value and the variance of 
Y, and then use the second moment method to show that w.h.p Y = r2(n'^). The rest of the 
proof will be similar to the case dmm > 3 (based as before on the use of Proposition I4.2p . 

Consider the exploration process defined in Section 12.11 starting from a node u £ V*. At 
the beginning, each step of the exploration process is an exponential with parameter two (since 
there are two yet-unmatched half-edges adjacent to the explored vertices). In each step, the 
probability that the new half-edge of the list L does not match to the other half-edge of L 
(which of course corresponds to the case that u is not in the giant component) is at least 
1 — 1/n. This follows by observing that there are at least n yet-unmatched half-edges (by 
> 1), and by using Lemma l4.ll (which says that before Mlogn steps the exploration process 
meets a vertex of forward-degree at least two). By the forward-degree coupling arguments 
of Section 12.21 the probability that a new matched node be of forward-degree one is at least 
Tf^"'^ This shows that, with probability at least (1 — l/n)7f^"'^ the exploration process adds 
a new node of forward-degree one. This shows that the first step in the exploration process 
a vertex of forward-degree at least two is added will be stochastically bounded below by a 
geometric random variable of parameter (1 — l/n)7f^^\ Each step takes rate two exponential 
time. Therefore, 

¥{£^) = P (t^{1) > ^^[^ log7^) > P (^Exp (2 (1 - (1 - l/n)7rS"))) > ^^^^ logn) . 

In the last inequality we used the fact that a sum of a geometric (with parameter vr) number 
of independent exponential random variables of rate n is distributed as an exponential random 
variable of rate (1 — vr)/i. Note that this in particular shows that 

nSu) > (l-o(l))exp(-(l-e)logn) = {1 - o{l))n~^+' . 

By the coupling arguments of Section [2.21 (and by using Lemma [4.ip . similar to the previous 
case dmin > 3, we have 

¥{£:^\£u) = {l±o{l))y. 

We conclude that 

F{Au) = nK I £u)n£u) > (1 ± o(l))2/n-l+^ 
This shows, as before, that 

E[y]= ^ FiA^) > (l±o(l)) yp2n . 

uev* 

Moreover, for any pair of vertices u,v £ V* such that n Cy = 0, conditioning on does 
not have much effect on the asymptotic of the degree distribution (by Lemma 14.11 the size of 
each component C„ is at most Mlogn), and hence, we deduce again, by the coupling argument 
of Section 12.21 that for u and v such that Cu n Ci, = 0, 

F{A„ n Au) = (1 ± o{l))F{A^)F{A^). 
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We infer as before, 



Var(y) = E[y2] _ E[Yf 



E 



u,v<^V* 



E 



E 



u,vev*:CunCv^9 



u,vev*: c„nc„= 



- E[Y] 



El-. E 



1a„ + 



E 



1a„1a„ 



u,vev*:CunCv= 



E[Yf 



< {K + l){Mlogn)E[Y]+E 
= o{E[Yf). 



E 1-"!-^ 

u,vGV*:Cur\Cv=(D 



E[YY (By Lemma ST 



In the inequality above, we used Lemma 14.11 to bound w.h.p. the size of aU C«, by Mlogn 
(for some large enough M) for any node w in the graph, and used the fact that if the event 
Au holds, then there are at most K edges out-going from C^. Each of the vertices v with 
the property that n C„ 7^ should be either already on Cu or connected with a path 
consisting only of vertices of degree two to Cu (in which case, this path should belong to C„). 
A simple analysis then shows that the number of nodes v with the property that H C^, 7^ is 
bounded by {K + l)Mlogn, and the inequality follows. The rest of the proof follows by using 
Proposition 14.21 similar to the case dmin ^ 3. 



Proof of the lower bound in the case dmin = !• Consider the 2-core algorithm, and stop 
the process the first time the number of nodes of degree one drops below n^"*^/^. Let V* be 
the set of all nodes of degree one at this time. We denote by Gn{V*) the graph constructed by 
configuration model on the set of remaining nodes (this is indeed the l/*-augmented 2-core). 
Observe that proving the lower bound on the graph GniV*) gives us the lower bound on G„. 

Since \ V*\ = o{n/ logn), and the 2-core has linear size in n, w.h.p. the degree sequence of 
Gn(y*) has the same asymptotic as the degree sequence in the 2-core of G„ (see Section lA.2t 
Lemma lA. 21 for more details). In particular, we showed in Section [A. 21 that for the size-biased 
degree sequence of the 2-core's degree distribution, we have qi = A*, and for its mean, we have 
z> = z^. 

Repeating the coupling arguments of Section 12.21 and defining vf ^"^ (similar to the definition 
of Tf^")) for the degree sequence of Gn{V*), we infer that vfi'^"'' — > A*. 

Similar as before, call a vertex u in V* good if both the events £u and hold. Recall the 
definition of the two events 

£u ■■= |r„(l) > i^lognj, and (43) 
:= {deg{Gu)<K]. (44) 
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here the constant K > 2is chosen with the property that qx > {q is the size-biased probabihty 

mass function corresponding to the 2-core, c.f. Section [A.2p . 

Consider the exploration process starting from a node u £ V* . At the beginning, each step 

of the exploration process is an exponential of rate one, and the probability that each new 

—M 

matched node be of forward-degree exactly one is at least tti . Similar to the case of dmin = 2, 
we obtain 

¥{Au) > (l±o(l))gKlP(Exp(l-#S"^) > l-e(l-A,)logn) 

= (1 ± o{l))qK exp(-(l - e) ^ ^* log re) 

1 - TTi 

= (l±o(l))9~i^re-l+^ 

This shows that 

E[Y] = n^u) > n^~'l\\ ± o(l))gx n-i+^ = (1 ± o(l))g>n^/2. 

Similarly, we obtain that Var(y) = o(E[y]^), and the rest of the proof follows similarly to the 
precedent cases by using Proposition 14.21 for Gn^*)- Note that in the number of 

vertices of degree one is o(n) = o(|G„(y*)|) and thus, Proposition 14.21 can be applied. 
At the present we are only left to prove Proposition 14.21 

4.1 Proof of Proposition 14.21 

In this section we present the proof of Proposition 14. 2[ It is shown in [23^ [28] that the giant 
component of a random graph G(re, (cii)") for [di)\ satisfying Condition 11.11 contains w.h.p. 

all but o(n) vertices (since u > 1 and Ug"^ + u^"^ = o(n)). This immediately shows that 
P(dist^ (Ca, Cfo) < oo) = 1 — o(l). Define tn := 2{i7-i) So to prove the proposition, we 

need to prove that distw{Ca, Ch) is lower bounded by t„ w.h.p. in the case where deg(Cb) = 0(1) 
and either deg(Ca) = 0(1) or a is chosen uniformly at random. 

In the case where a is chosen uniformly at random, it is easy to deduce, by using Markov's 
inequality, that we have w.h.p. deg(Ca) < log re. Indeed, this is true since deg(Ca) is asymptot- 
ically distributed as (^D + D — 1 | D > 2^ , where D is a random variable with the size-biased 

distribution, and D is independent of D with the degree distribution {pk}- (To show this, one 
can use the coupling argument of Section [2?2] to bound deg(Ca) stochastically from above.) And, 
since this latter random variable has finite moment (by Condition II. ip . by applying Markov's 
inequality, we obtain w.h.p. deg(Ca) < log re. This shows that in both cases stated in the 
proposition, we can assume that deg(Cb) = 0(1) and deg(C'a) < log re. 

We now consider the exploration process defined in Section [2.11 starting from Oq, i.e., we 
start the exploration process with B = Ca, and apply the steps one and two of the process. 
In a similar way we defined Ta{i), we define Tc^(z^ to be the time of the i-th step in this 
continuous-time exploration process. Similarly, let dca{i) be the forward-degree of the vertex 
added at i-th exploration step for alH > 1, and define 

ScAi) ■■= deg(0,) + dcAl) + - + dcAi) " h (45) 
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and define Sca{i) similarly, so that we have Scai^) ^ 'S'ca(i)- Note that Tca{i) obviously 
satisfies 

Tc„(i + l)-rc„(i)=Exp(5c;„(i)) >st K,~Exp(5c„«), 

where the random variables Yi are all independent. 
Also, we infer (by Lemma l2.3p that 

i 

ScAi)<stlogn + Y,D^^^ -i, (46) 



where D^J^^ are i.i.d with distribution vr^"^ 

Let V^"'^ be the expected value of which is 



and define Zn = ^/n/\ogn. We will show later that the two growing balls in the exploration 
processes started from Ca and C;,, for a and h as in the proposition, will not intersect w.h.p. 
provided that they are of size less than Zn- We now prove that Tc^{zn) > tn. with high 
probability. 
For this, let us define 

k / i 

r'(A:)~^Exp log n + j;;Df^ - 

i=i \ j=i 

where all the exponential variables in the above sum are independent, such that by the above 
arguments, we have 

TCa{Zn) >st T'{Zn). 

We need the following lemma. (We define Exp(s) := +00 for s < 0.) 

Lemma 4.3. Let Xi,...,Xt be a random process adapted to a filtration = a[0],Fi,...,Ft, 
and let fii = EXj, = Xi + ... + Xi, Aj = ^1 + ... + Hi. Let Yi ~ Exp(Sj), and Zi ~ Exp(Aj), 
where all exponential variables are independent. Then we have 

Yi + ... + Yt>stZi + ... + Zt. 

Proof. By Jensen's inequality, it is easy to see that for positive random variable X, we have 

Exp(X) >,t Exp(EX). 

Then by induction, it suffices to prove that for a pair of random variables Xi, X2 we have 
Yi + Y2 >st Zi + Z2. We have 

F{Yi + Y2>s) = KxAnYi + Y2>s\Xi)] 

> Kx, [IP(Exp(Xi) + Exp(Xi + fi2) > s)] 

> F{Zi + Z2>s). 

□ 
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We infer by Lemma \4.'6\ 

T'{zn) J^Exp (logn+ (I7(") - l)i) =: T*{zn), 



i=0 



where all exponential variables are independent. 

We now let 6„ := logn — (l/^") — 1), so that we have 

J'E^i<t i=i 

J0<?/i<-<?;.„<t 

where = Yli=o ^zn-i- Letting y play the role of and accounting for all permutations 
over ui, ...,yz^~i (giving each such variable the range [0,y]), we obtain 

[Zn - 1)! 

JO yJ[o,y]^'i-i y 



1=1 

(l/W _ 1)^ 
i / V r -1 



where c > is an absolute constant. Recall that t„ = 2{u-i) ^og^i ^'^'^ -^n = a/jt-/ log n. Now 
1 — e"'-'^ ~ ) < e~" , for some a > and for all < y < t„. We 

infer 

tri 



Jo 

since 6„ = O(logn). Hence, we have w.h.p. 

(Here naturally, for C y, we let B^{W,t) = {b, such that distu,(W,6) < t}.) 
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Similarly for b, and exposing B^{Ch, tn), again w.h.p we obtain a set of size at most Now 
remark that, because each matching is uniform among the remaining half-edges, the probability 
of hitting Bw{Ca-,tn) is at most Sca{zn) /n. 

Let e„ := log log n. By Markov's inequality we have 

= — =0(1). 

We conclude 

p(s^(Ca,t„)nS„,(a,t„,) < F (^\B^{Ca,tn)\ > Z„) + P(|5^(a,t„)| > Zn) 

+ P (ScaiZn) > ZnCn^ + e„Z^/n = o(l). 

This completes the proof of Proposition 14.21 

5 The random graphs G{n,p) and G{n,m) 

We derive the results for G{n,p) and G{n,m) from our results for G(n, by conditioning 

on the degree sequence. Indeed, we can be more general and consider a random graph Gn 
with n vertices labeled [l,n] and some random distribution of the edges such that any two 
graphs on [1, n] with the same degree sequence have the same probability of being attained by 
Gn- Equivalently, conditioned on the degree sequence, G„, is a random graph with that degree 
sequence of the type G{n, (di)'^) introduced in the Introduction. We may thus construct G„ by 
first picking a random sequence {di)i with the right distribution, and then choosing a random 
graph G{n, (dj)") for this (dj)" . 

We assume that Condition 11.11 holds in probability: 

Condition 5.1. For each n, let d^*^) = be the random sequence of vertex degrees of 

(n) 

Gn and uj^ be the random number of vertices with degree k. Then, for some probability 
distribution {pr)^Q over integers independent of n and with finite mean fj, := 'Yl,k>QkPk ^ 
(0,oo), the following holds: 

(i) u~^^ jn A pk for every /c > 1 as n — )• cx); 

(ii) For some e > 0, Y.k=\ ^^+'^^1"^ = Op(n); 

The following lemma is similar to Lemma 8.2 in |22j . 

Lemma 5.2. // Condition I5.il holds, we may, by replacing the random graph Gn by other 
random graphs G'n with the same distribution, assume that the random graphs are defined on 
a common probability space and that Condition holds a.s. 
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Proof. If only Condition 1 1.1 I f i) was required, this lemma would be a direct consequence of the 
Skorohod coupling theorem (Theorem 3.30 [25]) for the random sequence ("u^"^)^! in the space 
M.^ . We now explain how to incorporate Conditions ll.lf ii) . Condition 15.11 implies that it is 
possible to find an increasing sequence Cj for i > 1 diverging to infinity so that considering 
the sets: 

oo oo oo ^ 

k=l k=l k=l } 

W^ ^ 4 .^ ^ T _ /'Oo•^-l ,,(") - t - 



we have for ah n, P(K O ^ ^j) > 1 - (2i)" (note that Efeli < = "-)• Let q) 

Qj+i — of^ — 1 ~ (^i)"""^ for all j > 1. For each £, we define an 

associated finite sequence: j for i = 1,...,A;(")(^) such that jj"^(£) = 1 and for i > 1, 

jil\{i)=min{j>jt\i), > Ti} if ^|)(,) < ^-m'' --difg|i^^^ >l-{2i)-\ 

we set A;(")(£) = i. Let j(")(£) = = ■ ■ ■ , j^^ll^^^ii)} ■ Note that, since ^ > 

1 - i2i)-\ we have k^^'^i) < I. 

We now explicitly construct a 'Skorokhod coupling'. Let be a uniform random variable 
in [0,1] and define the random variable j(")(^) by: = min{i E J^'^\t),e < qf"^} if 



9 < and if 9 > g^'J,^^^, we set j(")(£) = oo. We set jj")(£) = 0, jf ^(£) = oo for 

i > k^'^\i), ^0 = and ^oo = With these definitions, we have for all n and i > 1, 

P(JW(£) = jf •)(£)) = P (^(4")) G . 

For a given ^ and for any i > 1, we define the random variables u(")(f) = (it^"^(0)fceN S MJ^ 
having the law of ("ul"^) conditioned on the event {(4"^) ^ ^ ■(")ff\\^ ■(") /^^}• Note in particular 

that by construction, if i < A;(")(^), we have P (^(4"^) ^ ^^.(n)^^^\^^.(n) ^^^^ > (2£)-i, hence if 

there exist an infinite sequence of n such that i < fc("^(£), then we can apply the Skorohod 
coupling theorem and assume that, along this subsequence. Condition I l.ll fi) holds. 

We can now combine this coupling with the following one: given 9 taken uniformly at 
random in [0, 1], take i = [ 2(1-0) "I consider u^'^\j^'^\£)) which has the same law as the 
original u^'"). By construction, Condition ll.lf i) holds. Moreover, we have by construction 
jW(^) < £ since gj"^ > 1 - {2£)-^ > 9, so that u^''\j^''\£)) E Ai and Condition O^ii) holds. 

□ 

We now show that for G{n,p) and G{n,m), with np ^ fi £ (0, 00) and 2m/n — )• /i, 



Condition 15.11 holds with (p^) a Poisson distribution with parameter ^, i.e., pk = 
Indeed the fact that Condition IS-lT i) holds with such {pk) follows by elementary estimates of 
mean and variance done in Example 6.35 of [23]. Showing that Condition 15.1^ 11) holds can be 
done by similar arguments. Consider G{n,p) (a similar argument holds for G{n,m)), we have 
for all > and for n sufficiently large 
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Thus, X^fcli ^^^'EuP = 0(1), and Condition ED^ii) holds. 



Acknowledgements 

This paper is a part of the first author's PhD thesis. We are grateful to Remco van der Hofstad 
and Laurent Massoulie who reviewed the thesis, and made many valuable comments which 
helped to improve the presentation. We also thank Omid Amini for helpful comments on the 
first draft of the paper which helped to improve the readability. 



References 

[1] H. Amini and M. Lelarge. Large deviations for split times of branching processes. Preprint, 
2011. 

[2] K. B. Athreya and P. E. Ney. Branching processes. Dover Publications, 2004. 

[3] S. Bhamidi. First passage percolation on locally tree- like networks. I. dense random graphs. 
Journal of Mathematical Physics, 49(12):125218, 2008. 

[4] S. Bhamidi and G. H. R. van der Hofstad. First passage percolation on the Erdos-Renyi 
random graph. Available at http://arxiv.org/ahs/1005.4-104 , 2010. 

[5] S. Bhamidi, R. van der Hofstad, and G. Hooghiemstra. Extreme value theory, Poisson- 
Dirichlet distributions and first passage percolation on random networks. Advances in 
applied probability, 42(3):706-738, 2010. 

[6] S. Bhamidi, R. van der Hofstad, and G. Hooghiemstra. First passage percolation on 
random graphs with finite mean degrees. Annals of Applied probability, 20(5):1907-1965, 
2010. 

[7] B. Bollobas. Random Graphs. Cambridge University Press, 2001. 

[8] B. Bollobas and W. F. de la Vega. The diameter of random regular graphs. Combinatorica, 
2(2):125-134, 1982. 

[9] B. Bollobas, S. Janson, and O. Riordan. The phase transition in inhomogeneous random 
graphs. Random Structures and Algorithms, 31:3-122, 2007. 

[10] F. Chung and L. Lu. The average distance in a random graph with given expected degrees. 
Internet Mathematics, 1(1):91-113, 2003. 

[11] C. Cooper. The cores of random hypergraphs with a given degree sequence. Random 
Structures & Algorithms, 25(4):353-375, 2004. 

[12] J. Ding, J. H. Kim, E. Lubetzky, and Y. Peres. Diameters in supercritical random graphs 
via first passage percolation. Combinatorics, Probability and Computing, 19:729-751, 2010. 



40 



[13] D. Fernholz and V. Ramachandran. Cores and connectivity in sparse random graphs. 
Technical Report TR-04-13, The University of Texas at Austin, Department of Computer 
Sciences, 2004. 

[14] D. Fernholz and V. Ramachandran. The diameter of sparse random graphs. Random 
Structures and Algorithms, 31(4):482-516, 2007. 

[15] G. Grimmett and H. Kesten. First-passage percolation, network flows and electrical resis- 
tances. Probability Theory and Related Fields, 66:335-366, 1984. 10.1007/BF00533701. 

[16] O. Haggstrom and R. Pemantle. First passage percolation and a model for competing 
spatial growth. Journal of Applied Probability, 35(3):683-692, 1998. 

[17] R. van der Hofstad, G. Hooghiemstra, and P. V. Mieghem. The flooding time in the 
random graph. Extremes, 5(2):111-129, 2002. 

[18] R. van der Hofstad, G. Hooghiemstra, and P. V. Mieghem. Distances in random graphs 
with finite variance degrees. Random Structures and Algorithms, 27(1):76-123, 2005. 

[19] R. van der Hofstad, G. Hooghiemstra, and P. Van Mieghem. First-passage percolation on 
the random graph. Probability in the Engineering and Informational Sciences, 15:225-237, 
April 2001. 

[20] S. Janson. One, two and three times log n/n for paths in a complete graph with random 
weights. Combinatorics, Probability and Computing, 8(4):347-361, 1999. 

[21] S. Janson and M. Luczak. A simple solution to the k-core problem. Random Structures 
& Algorithms, 30:50-62, 2007. 

[22] S. Janson and M. J. Luczak. Asymptotic normality of the k-coie in random graphs. Annals 
of Applied Probability, 18:1085, 2008. 

[23] S. Janson and M. J. Luczak. A new approach to the giant component problem. Random 
Structures and Algorithms, 34(2):197-216, 2009. 

[24] S. Janson, T. Luczak, and A. Rucinski. Random graphs. Wiley-Interscience Series in 
Discrete Mathematics and Optimization. Wiley-Interscience, New York, 2000. 

[25] O. Kallenberg. Foundations of Modern Probability. 2nd ed.. Springer- Verlag, New York, 
2002. 

[26] H. Kesten. Aspects of first passage percolation. In Ecole d'ete de probabilites de Saint- 
Flour, XIV — 1984, volume 1180 oi Lecture Notes in Mathematics, pages 125-264. Springer, 
Berlin, 1986. 

[27] A. Klenke and L. Mattner. Stochastic ordering of classical discrete distributions. To appear 
in Advances in Applied probability, 2010. 

[28] M. Molloy and B. Reed. The size of the giant component of a random graph with a given 
degree sequence. Combinatorics, Probability and Computing, 7:295-305, 1998. 



41 



[29] O. Riordan and N. Wormald. The diameter of sparse random graphs. Combinatorics, 
Probability and Computing, 19:835-926, 2010. 

A Appendix 

A.l Branching process interpretation 

In this section, we give the intuition behind the introduction of the parameters A and A*, o 
Let Xq be a Galton- Watson tree with offspring distribution q given in Equation ([1]). Then A 
is the extinction probabihty of this branching process. Let C Xq be the set of particles of 
Xq that survive, i.e., have descendants in ah future generations. Then X^ contains the root of 
the original branching process with probability 1 — A, and is empty otherwise. We denote by D 
the random variable with distribution ¥(D = k) = qt- For r] S [0, 1], we let Dr^ be the thinning 
of D obtained by taking D points and then randomly and independently keeping each of them 
with probability r/, i.e., 

oo ^ ^ 

nD, = k) = Y,qr\^^y\i-r^y-K 

Note that the number of surviving children has the distribution Di_x- Let denote the 
offspring distribution in X^ . Conditioning on a particle being in X^ is exactly the same as 
conditioning on at least one of its children surviving, so that 

F{D+ = l) = P(L>i_A = l|^l-A > 1) 

Efc>ofcgfcA(l-A)^-i 
A 

= (j)'q{X) = K. 

Hence, the probability that the particles in generation k in X^ , consists of a single particle, 
given that the whole process survives, is exactly A^. This event corresponds to the branching 
process staying thin for k generations. 

A. 2 Structure of the 2-core 

The /c-core of a given graph G is the largest induced subgraph of G with minimum vertex 
degree at least k. The /c-core of an arbitrary finite graph can be found by removing vertices of 
degree less than k, in an arbitrary order, until no such vertices exist. The existence of a large 
k-core in a random graph with a given degree sequence has been studied by several authors, 
see for example Cooper |llj . Fernholz and Ramachandran jl3j . and Janson and Luczak |21j . 

Consider now a random graph G„ ~ G*{n, (dj)"). In the process of constructing a random 
graph Gn by matching the half-edges, the /c-core can be found by successively removing the 
half-edge of a node of degree less than k followed by removing a uniformly random half-edge 
from the set of all the remaining half-edges until no such vertices (of degree less than k) remain. 
What remains at this time is the k-coie. Since these half-edges are unexposed, the k-coie edge 
set is uniformly random conditional on the A;-core half-edge set. 
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Let /c > 2 be a fixed integer, and Core^"^ be the fc-core of tlie grapfi G„ G*{n, {di)^). We 
shall consider thinnings of the vertex degrees in G*{n, {di)^). Let D be the random variable 
with the distribution P(D = r) = pr, which is the asymptotic distribution of the vertex degrees 
in G*{n, {di)i). Recall that for < p < 1, Dp denote the thinning of D (obtained by taking 
D points and then randomly and independently keeping each of them with probability p) . For 
integers I > and < r < Z, let TTir denote the binomial probabilities 

ttUp) = P(Bin(/,p) = r) = Q/(l -p)'-. (47) 

Hence we have 

oo 

FiDp = r) = Y,Pmr{p)- (48) 

l=r 

We further define the functions 

oo oo 

h{p) := E[Dpl{Dp>k)]=^^rpi7rir{p), (49) 

r=k l=r 
oo oo 

hi{p) := F{Dp>k)=Y,Y.Pl''l-^P^- (50) 

r=k l=r 

Theorem A.l (Fernholz, Ramachandran [13] - Janson, Luczak [21]). Consider a random graph 
G{n, {di)i) where the degree sequence {di)i satisfies Condition \l.l[ Let k > 2 be fixed, and let 
Core^"^ be the k-core of G{n, (d^)"). Let p be the largest p < 1 such that fip^ = h{p). 

(i) If p = 0, i.e., if lip^ > h{p) for all p G (0, 1], then Gore"^^ has Op{n) vertices and Op{n) 
edges w.h.p. 

(a) If p > 0, and further suppose that p is not a local maximum point of the function h{p) — 
^p^ . Then 

v{Cove^^^^/n /ii(p) > 0, (51) 

oo 

V, (Corei")) /n ^ P {Dp = j) = Y,pm0), j > k, (52) 
e (^CoreJ^"^) /n ^ h{p) /2 = ^if /2. (53) 

Prom now on, we consider the case k = 2, and denote by G the 2-core of a graph G. In 
particular applying Theorem lA.il to the case k = 2, we have 

hip) := E [Dpi (Dp > 2)] 

= E[Dp]-F{Dp = l) 

= IJ-P Pi P{'^ - pf~^ 
I 

= pip{l-G,{l-p)). 
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Recall from Theorem lA.ll that we have to solve the equation = /i(j5), thus, we obtain 
1 — p = Gq{l — p), and so p = 1 — A. 

The graph G„ obtained from G„ has the same distribution as a random graph constructed 
by the configuration model (see e.g., [21]) on n nodes with a degree sequence d^^\ ...,d!'^^ 
satisfying the following properties. By Theorem lA.il 

n/n 4 /ii(l- A) := E[Z)i_a > 2] 

= 1 - Gp(A) - (1 - A)G;(A) 
= 1 - Gp(A) - /iA(l - A) > 0, 



and 



Jin) 



j}\/n 4 f^piQil-XyX'-^, j>2, 



e=j 

i 

It follows that the sequence {d^i \ d'^^} satisfies also the Condition 11.11 for some probability 
distribution with mean jl (which can be easily calculated from the two above properties). 

Let q be the size-biased probability mass function corresponding to p. We now show that 
q and q have the same mean. Indeed, denoting by P the mean of q, we see that z> is given by 



kqk = ■^'Yk{k - l)pk 
k ^ k 

Ek>2Hk-l)Ee>kPe{i){l-X)'X'-' 
Ml-A)2 

E.P^E.<^fc(fe-l)(D(l-A)^A^"^ 
Ml-A)2 



V. (54) 



To find the diameter in the case dmin = 1, we also need to show that qi 

2p2 2E,>2P.(2)(1-A)'A^-' 



2 



= 1g;'(A) = G;(A) = A,. (55) 

We will also need the following relaxation of the notion of 2-core. Let G = {V, E) be a 
graph. For a given subset W C.V^ define the M^-augmented 2-core to be the maximal induced 
subgraph of G such that every vertex in 1/ \ has degree at least two, i.e., the vertices in W 
are not required to verify the minimum degree condition in the definition of the 2-core. The 
W^-augmented 2-core of a graph G will be denoted by G(W). 
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It is easy to see that the VK-augmented 2-core of a random graph G„ ~ G*{n, (dj)"), denoted 
by Gn{W), can be found in the same way as the 2-core, except that now the termination 
condition is that every nodes outside of W must have degree at least 2, since the half-edges 
adjacent to a vertex in W are exempt from this restriction. The conditional uniformity property 
thus evidently holds in this case as well, i.e., for any subset W C V, the VF-augmented 2-core 
is uniformly random conditional on the VF-augmented 2-core half-edge set. We will need the 
following basic result, the proof of which is easy and can be found for example in [141 Lemma 
A.7]. 

Lemma A. 2. Consider a random graph G„ ~ G(n, {di)i) where the degree sequence {di)i 
satisfies Condition \ 1. 1[ For any subset W C V{Gn), and any w E W, there exists C > 
(sufficiently large) so that we have 

P {e{Gn{W)) - e{Gn{W \ {w})) < Clogn) = 1 - o{n'^). 

Note that the above lemma implies (by removing one vertex from at a time) that if \W\ = 
o(n/ log n), then w.h.p. the two graphs G„ and GniW) have the same degree distribution 
asymptotic. We end this section by giving now an overview of the proof. 

A. 3 Proof of Lemma [2.11 

To prove Lemma |2. 11 we need the following intermediate result proved in |14l Lemma 3.2]. 

Lemma A. 3. Let A he a set of m points, i.e., \A\ = m, and let F be a uniform random 
matching of elements of A. For e & A, we denote by F{e) the point matched to e, and similarly 
for X C A, we write F{X) for the set of points matched to X. Now let X C A, k = \X\, and 
assume k < m/2. We have 



\X n F{X)\ <st Bm{k, ^Jklm). 

Conditioning on all the possible degree sequences da(l),da(2), . . . , da{k), with the property 
that da + ^^KiKkdaii) = Sa{k), the configuration model becomes equivalent to the following 
process: start from a and at each step I < i < k, choose a vertex of degree da{i) + l uniformly 
at random from all the possible vertices of this degree outside the set {a, ai, . . . , aj_i}, choose 
a half-edge adjacent to Oj uniformly at random and match it with a uniformly chosen half-edge 
from the yet-unmatched half-edges adjacent to one of the nodes a, ai, . . . , aj_i. And at the end, 
after aj. has been chosen, take a uniform matching for all the remaining (m^") — 2k) half-edges. 
Now the proof follows from Lemma lA.3l by the simple observation that, since m^"'^ — 2k > n, 

Bin (^Sa{k), ^JSa{k)/m(-) -2k^ > x \ ^.(fc)^ < P (^Bin (^Sa{k), \/Sa{k)/n^ > x \ Sa{k)^ . 
A. 4 Proof of Lemma [2.21 

Given J^rp^f^^^, the increment Xa{i + 1) — Xa{i) is stochastically dominated by the following 
binomial random variable 

Xa{i + 1) - Xa{i) <st Bin (da{t + 1), ^^"^'^ " 



mW - 2{Xa{i) + i) 
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For i < |, we have 



(Saii) - 1)+ ^ Sa{i) - 2Xa{i) 



- 2{Xa{i) + i) ~ m(") - 2{Xa{i) + i) 

Sa{i) 



< 



< 



- 2i 
max£<j Sa{i) 
n — 2i 



Hence, we obtain for i < n/2 that 



Xa{t) <st Bm max Sa{i) + i, • 

\ £<i n — 2i I 

A. 5 Proof of Lemma [2.31 

Fix the sequence of degrees {d[^^} and the initial vertex a. We now prove that conditionally 
on the values of (da(l), ...,da{j — 1)), the random variable da{j) is stochastically smaller than 

Dj provided that j < j3n- This can be seen by a simple coupling argument as follows. In the 
following we will look at the half-edges adjacent to a node i as balls in a bin labeled with i, so 
the corresponding bin to the node i has di balls. First order the balls from 1 to m consistently 
with the order statistics. In other words, the values given to balls in each bin form an interval 
of consecutive numbers, and the values of the balls in the i-th bin is smaller than the values of 
the balls in the (i + l)-th bin for each i. 

Given the sequence (da, da{l), ...,da{j — 1)), choose uniformly at random a set of j — 1 bins 
containing respectively + 1, . . . , da{j — 1) + 1 balls and color in red all the balls of these 

— (n) 

bins. In order to get a sample for , pick a ball at random among all balls in the last 
n- ~ Wn + 1) bins and set Dj to be equal to the size of the selected bin minus one. We now 
define a random variable dj such that da{j) <st dj < -^j"^ obtaining the desired inequality. 

— (n) ~ — (n) 

If the ball picked in defining Dj is white, set dj = Dj . If there are red balls in the last 
IT- ~ (/3n + l)^n bins, and if the chosen ball is red, suppose that this ball is the ^-th ball among 
all the red balls in the last n — {13^ + bins for the induced order by the enumeration of 

the balls, and then define dj to be the size of the bin containing the £-th white ball minus one. 
Since da + da{'\.) + - • • + da{j — '^) < /3nA.„, this ball is in one of the first {fin + 1)A„ bins. In other 

— (n) 

words, in all cases we have dj < Dj . By the definition of dj and the definition of stochastically 

dominance, it is fairly easy to show that , da{j) <.st dj, conditioned on {da{l), da{j — 1)). 

Thus, we obtain one of the inequalities. A similar argument proves the other inequality 
D_j^^ <st dj conditioned on the sequence {da,da{l), ...,da{j — 1)), and so the first part of the 
lemma follows. 
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The second statement follows directly from the following basic result (whose proof can be 
found in [T3] Lemma 3.2). 

Lemma A. 4. Let Xi,...,Xt be a random process adapted to a filtration = cr[0], J-j, 
and let = Xi + ... + Xt- Consider a distribution ^ such that {Xsj^i\Ts) ^st (resp. 
{Xs-\-i\Ts) ^st fJ-) for all < s < t — 1. Then 'St is stochastically larger (resp. smaller) than 
the sum oft i.i.d. ^-distributed random variables. 
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